Viwav2vec2 Base 100h
V
Viwav2vec2 Base 100h
Developed by dragonSwing
A base Wav2Vec2 model pretrained on 100 hours of unlabeled Vietnamese speech audio from the VLSP dataset, requiring fine-tuning for downstream tasks.
Downloads 19
Release Time : 3/2/2022
Model Overview
This is a Vietnamese speech pretrained model based on the Wav2Vec2 architecture, trained with 16kHz sampled speech data, suitable for downstream tasks such as automatic speech recognition.
Model Features
Vietnamese Speech Pretraining
Specifically pretrained on Vietnamese speech data, suitable for Vietnamese speech processing tasks.
16kHz Sampling Support
The model is trained with 16kHz sampled speech data; ensure input data has the same sampling rate during use.
Based on Wav2Vec2 Architecture
Utilizes the Wav2Vec2 architecture proposed by Facebook, capable of learning speech structures from raw audio.
Model Capabilities
Speech Feature Extraction
Vietnamese Speech Recognition
Use Cases
Speech Technology
Vietnamese Automatic Speech Recognition
Achieve Vietnamese speech-to-text functionality by fine-tuning this model.
Featured Recommended AI Models