Wav2vec2 Large Xls R 300m Vietnamese Colab
W
Wav2vec2 Large Xls R 300m Vietnamese Colab
Developed by Jungwonchang
This model is a Vietnamese speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m
Downloads 22
Release Time : 3/17/2022
Model Overview
This is a Vietnamese-optimized speech recognition model based on the wav2vec2 architecture, suitable for Vietnamese speech-to-text tasks
Model Features
Vietnamese optimization
Specially fine-tuned for Vietnamese to improve speech recognition accuracy
Based on XLS-R architecture
Utilizes Facebook's XLS-R large-scale cross-lingual speech representation learning architecture
Medium scale
A balanced model with 300 million parameters, considering both performance and efficiency
Model Capabilities
Vietnamese speech recognition
Speech-to-text
Automatic speech transcription
Use Cases
Speech transcription
Vietnamese meeting minutes
Automatically convert Vietnamese meeting recordings into text transcripts
Voice assistant
Provide speech recognition capabilities for Vietnamese voice assistants
Education
Language learning applications
Help learners practice Vietnamese pronunciation and listening
Featured Recommended AI Models
Š 2025AIbase