W

Wav2vec2 Large Xlsr 53 Russian

Developed by jonatasgrosman
A Russian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input
Downloads 3.9M
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model optimized for Russian, fine-tuned on the XLSR-53 architecture, demonstrating excellent performance on the Common Voice Russian dataset

Model Features

High-performance Russian recognition
Achieves 13.3% word error rate and 2.88% character error rate on the Common Voice Russian test set
Language model enhancement support
When combined with a language model, the word error rate can be reduced to 9.57% and character error rate to 2.24%
Multi-dataset training
Trained and validated using Common Voice 6.1 and CSS10 datasets
16kHz sampling rate support
Optimized for 16kHz sampled audio input

Model Capabilities

Russian speech-to-text
Long audio processing (supports chunk processing)
Real-time speech recognition

Use Cases

Speech transcription
Russian speech transcription
Convert Russian speech content to text
Achieves 13.3% word error rate on the Common Voice test set
Voice assistants
Russian voice command recognition
Recognize Russian voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase