W

Wav2vec2 Large Xlsr 53 Russian

Developed by anton-l
A Russian automatic speech recognition (ASR) model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving a 17.39% word error rate (WER) on the Common Voice Russian dataset.
Downloads 735
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition model specifically optimized for Russian, based on the Wav2Vec2 architecture, suitable for converting Russian speech to text.

Model Features

High-accuracy Russian recognition
Achieves a 17.39% word error rate (WER) on the Common Voice Russian test set
No language model required
Can be used directly without additional language model support
Based on XLSR pre-training
Fine-tuned from a large-scale cross-lingual speech representation (XLSR) pre-trained model

Model Capabilities

Russian speech recognition
Speech-to-text
16kHz audio processing

Use Cases

Speech transcription
Russian meeting minutes
Automatically transcribe Russian meetings or interviews
Approximately 82.61% accuracy (WER 17.39%)
Voice assistant
Provide speech recognition capabilities for Russian voice assistants
Accessibility technology
Real-time caption generation
Generate real-time captions for Russian video content
Featured Recommended AI Models
ยฉ 2025AIbase