W

Wav2vec2 Xls R 1b Russian

Developed by jonatasgrosman
Russian speech recognition model fine-tuned based on XLS-R 1B architecture, trained on datasets like Common Voice 8.0
Downloads 765
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model optimized for Russian, fine-tuned from Facebook's XLS-R 1B architecture, supporting 16kHz sampling rate audio input.

Model Features

High-performance Russian recognition
Achieves 9.82% WER and 2.3% CER on Common Voice 8.0 test set
Language model enhancement
With language model integration, WER can be reduced to 7.08% and CER to 1.87%
Multi-dataset training
Trained on multiple datasets including Common Voice 8.0, Golos, and Multilingual TEDx
Robust performance
Achieves 14.23% WER on robust speech event test data

Model Capabilities

Russian speech recognition
Speech-to-text
Supports 16kHz sampling rate audio processing

Use Cases

Speech transcription
Russian speech-to-text
Convert Russian speech content into text
Highly accurate transcription results
Voice assistants
Russian voice command recognition
Used for voice command recognition in Russian voice assistants or control systems
Fast and accurate command understanding
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase