W

Wav2vec2 Base Vietnamese

Developed by dragonSwing
Vietnamese speech recognition model based on Wav2Vec2 architecture, fine-tuned on VSLP dataset, supports 16kHz sampled speech input
Downloads 16
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) system optimized for Vietnamese, based on Facebook's Wav2Vec2 architecture, fine-tuned with 100 hours of annotated data, and can be directly used for speech-to-text tasks

Model Features

Vietnamese optimization
Specially trained and optimized for Vietnamese speech characteristics
No language model required
Can be used directly without additional language model support
Efficient processing
Supports 16kHz sampled speech input, suitable for real-time applications

Model Capabilities

Vietnamese speech recognition
Speech-to-text
Automatic speech recognition

Use Cases

Speech transcription
Speech transcription
Convert Vietnamese speech content into text
WER of 31.35% on Common Voice test set
Smart assistants
Vietnamese voice command recognition
Used for human-computer interaction in Vietnamese smart voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase