W

Wav2vec2 Large Xls R 300m Vietnamese Colab

Developed by Jungwonchang
This model is a Vietnamese speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m
Downloads 22
Release Time : 3/17/2022

Model Overview

This is a Vietnamese-optimized speech recognition model based on the wav2vec2 architecture, suitable for Vietnamese speech-to-text tasks

Model Features

Vietnamese optimization
Specially fine-tuned for Vietnamese to improve speech recognition accuracy
Based on XLS-R architecture
Utilizes Facebook's XLS-R large-scale cross-lingual speech representation learning architecture
Medium scale
A balanced model with 300 million parameters, considering both performance and efficiency

Model Capabilities

Vietnamese speech recognition
Speech-to-text
Automatic speech transcription

Use Cases

Speech transcription
Vietnamese meeting minutes
Automatically convert Vietnamese meeting recordings into text transcripts
Voice assistant
Provide speech recognition capabilities for Vietnamese voice assistants
Education
Language learning applications
Help learners practice Vietnamese pronunciation and listening
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase