S

Stt Kk Ru Fastconformer Hybrid Large

Developed by nvidia
NVIDIA FastConformer-Hybrid Large (kk-ru) is a speech recognition model that can transcribe Kazakh and Russian speech into lowercase text.
Downloads 930
Release Time : 9/10/2024

Model Overview

This model is based on the FastConformer Transducer - CTC architecture and is a hybrid model. It is trained by combining two loss functions, Token - and - Duration Transducer and CTC, and is suitable for speech recognition tasks in Kazakh and Russian.

Model Features

Multilingual support
Supports speech recognition in Kazakh and Russian.
Hybrid model architecture
Trained by combining two loss functions, Token - and - Duration Transducer and CTC, to improve model performance.
High performance
Performs well on multiple test sets with a low word error rate (WER).

Model Capabilities

Speech recognition
Multilingual transcription
Non-streaming speech processing

Use Cases

Speech transcription
Kazakh speech transcription
Transcribe Kazakh speech into text.
The WER on the KSC2 test set (read speech) is 4.43%.
Russian speech transcription
Transcribe Russian speech into text.
The WER on the MCV12 test set is 6.29%.
Featured Recommended AI Models
ยฉ 2025AIbase