Wav2vec2 Large Xls R 300m Russian Colab Beam Search Test
This model is a Russian speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m, achieving a word error rate of 0.468 on the evaluation set.
Downloads 18
Release Time : 4/7/2022
Model Overview
This is a speech recognition model optimized for Russian, fine-tuned based on the wav2vec2-xls-r-300m architecture, suitable for Russian speech-to-text tasks.
Model Features
Russian Optimization
Specifically fine-tuned for Russian speech data, improving the accuracy of Russian recognition.
Low Word Error Rate
Achieved a word error rate of 0.468 on the evaluation set, demonstrating good performance.
Fine-tuned from Large Model
Fine-tuned from the facebook/wav2vec2-xls-r-300m large model, inheriting its powerful speech feature extraction capabilities.
Model Capabilities
Russian Speech Recognition
Speech-to-Text
Automatic Speech Recognition
Use Cases
Speech Transcription
Russian Meeting Minutes
Automatically transcribe Russian meeting recordings into text
Accuracy approximately 53.2% (word error rate 0.468)
Russian Voice Assistant
Used as the speech recognition module for Russian voice assistants
Education
Russian Learning Aid
Assist Russian learners in checking pronunciation accuracy
Featured Recommended AI Models
Š 2025AIbase