S

Stt Rw Conformer Transducer Large

Developed by nvidia
This is a large Conformer-Transducer model for Kinyarwanda speech recognition, which can transcribe speech into lowercase Latin letters, supporting spaces and apostrophes.
Downloads 116
Release Time : 7/31/2022

Model Overview

This model is used to transcribe speech into lowercase Latin letters containing spaces and apostrophes and is trained on approximately 2000 hours of Kinyarwanda speech data.

Model Features

High-accuracy transcription
It can accurately transcribe speech into lowercase Latin letters, supporting spaces and apostrophes.
Large model architecture
Based on the non-autoregressive 'large' variant of Conformer, with approximately 120 million parameters and powerful performance.
Ease of use
It can be used in the NeMo toolkit for convenient inference and fine-tuning.

Model Capabilities

Speech recognition
Speech transcription
Support for Kinyarwanda

Use Cases

Speech transcription
Audio file transcription
Transcribe Kinyarwanda speech files into text
The accuracy is relatively high, and the WER on the test set is 16.19%.
Featured Recommended AI Models
ยฉ 2025AIbase