W

Wav2vec2 Large Xlsr Vietnamese

Developed by CuongLD
This is a Vietnamese fine-tuned speech recognition model based on facebook/wav2vec2-large-xlsr-53, trained using the Common Voice and Infore_25h datasets.
Downloads 37
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Vietnamese speech recognition tasks, supporting 16kHz sampling rate audio input.

Model Features

Multi-dataset training
Trained using both Common Voice and Infore_25h datasets to enhance model generalization.
16kHz sampling rate support
Specially optimized for 16kHz sampling rate audio input recognition.
No language model required
Can be used directly without additional language model support.

Model Capabilities

Vietnamese speech recognition
Automatic speech-to-text

Use Cases

Speech transcription
Vietnamese speech transcription
Convert Vietnamese speech content into text
WER of 58.63% on Common Voice Vietnamese test set
Voice assistants
Vietnamese voice command recognition
Basic speech recognition component for Vietnamese voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase