V

Viwav2vec2 Base 100h

Developed by dragonSwing
A base Wav2Vec2 model pretrained on 100 hours of unlabeled Vietnamese speech audio from the VLSP dataset, requiring fine-tuning for downstream tasks.
Downloads 19
Release Time : 3/2/2022

Model Overview

This is a Vietnamese speech pretrained model based on the Wav2Vec2 architecture, trained with 16kHz sampled speech data, suitable for downstream tasks such as automatic speech recognition.

Model Features

Vietnamese Speech Pretraining
Specifically pretrained on Vietnamese speech data, suitable for Vietnamese speech processing tasks.
16kHz Sampling Support
The model is trained with 16kHz sampled speech data; ensure input data has the same sampling rate during use.
Based on Wav2Vec2 Architecture
Utilizes the Wav2Vec2 architecture proposed by Facebook, capable of learning speech structures from raw audio.

Model Capabilities

Speech Feature Extraction
Vietnamese Speech Recognition

Use Cases

Speech Technology
Vietnamese Automatic Speech Recognition
Achieve Vietnamese speech-to-text functionality by fine-tuning this model.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase