Viwav2vec2-base-100h Open-source Speech Model - Pre-trained on Vietnamese data to facilitate fine-tuning of downstream tasks

Viwav2vec2 Base 100h

Developed by dragonSwing

A base Wav2Vec2 model pretrained on 100 hours of unlabeled Vietnamese speech audio from the VLSP dataset, requiring fine-tuning for downstream tasks.

Speech Recognition

Transformers

OtherOpen Source License:Apache-2.0 #Vietnamese Speech Recognition #16kHz Audio Adaptation #Unsupervised Pretraining

Downloads 19

Release Time : 3/2/2022

Model Overview

This is a Vietnamese speech pretrained model based on the Wav2Vec2 architecture, trained with 16kHz sampled speech data, suitable for downstream tasks such as automatic speech recognition.

Model Features

Vietnamese Speech Pretraining

Specifically pretrained on Vietnamese speech data, suitable for Vietnamese speech processing tasks.

16kHz Sampling Support

The model is trained with 16kHz sampled speech data; ensure input data has the same sampling rate during use.

Based on Wav2Vec2 Architecture

Utilizes the Wav2Vec2 architecture proposed by Facebook, capable of learning speech structures from raw audio.

Model Capabilities

Speech Feature Extraction

Vietnamese Speech Recognition

Use Cases

Speech Technology

Vietnamese Automatic Speech Recognition

Achieve Vietnamese speech-to-text functionality by fine-tuning this model.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Viwav2vec2 Base 100h

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Wav2Vec2-Base-Pretrain-Vietnamese

🚀 Quick Start

💻 Usage Examples

Basic Usage

📄 License