S

Stt Ru Conformer Ctc Large

Developed by nvidia
This is a large Conformer-CTC model for Russian automatic speech recognition, trained on approximately 1,636 hours of Russian speech data with about 120 million parameters.
Downloads 452
Release Time : 11/1/2022

Model Overview

This model transcribes Russian speech into lowercase Cyrillic text with spaces, using the Conformer architecture and CTC loss function, suitable for high-quality speech-to-text applications.

Model Features

High-performance Russian recognition
Excellent performance on multiple Russian test sets, such as a WER of only 4.28% on the Common Voice 10.0 test set
Large-scale training data
Trained on approximately 1,636 hours of Russian speech data, including datasets from various sources
Non-autoregressive architecture
Uses the Conformer-CTC architecture for efficient single-pass forward inference
Supports multiple application scenarios
Suitable for both close-talking and far-field speech recognition, with good performance in both crowd and farfield scenarios

Model Capabilities

Russian speech recognition
Real-time speech-to-text
Supports 16kHz mono audio input

Use Cases

Speech transcription
Meeting minutes
Automatically transcribe Russian meeting recordings into text records
Highly accurate transcription results
Voice assistant
Provide speech recognition capabilities for Russian voice assistants
Low-latency interaction experience
Media processing
Video subtitle generation
Automatically generate subtitles for Russian video content
Subtitles with accuracy exceeding 95%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase