Wav2vec2 Xls R 300m Demo Colab
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the common_voice dataset
Downloads 16
Release Time : 3/2/2022
Model Overview
A fine-tuned model for speech recognition tasks, based on the wav2vec2-xls-r-300m architecture, trained on the common_voice dataset
Model Features
Efficient Fine-tuning
Fine-tuned based on the pre-trained wav2vec2-xls-r-300m model, achieving good results on the common_voice dataset
Excellent Performance
Achieved a word error rate of 1.0377 on the evaluation set, demonstrating outstanding performance
Fast Inference
Can process 25.239 samples per second, with relatively fast inference speed
Model Capabilities
Speech Recognition
Audio to Text
Use Cases
Speech Transcription
Speech to Text
Convert speech content into text records
Word error rate 1.0377
Voice Assistants
Voice Command Recognition
Recognize user voice commands
Featured Recommended AI Models
Š 2025AIbase