๐ KinyaWhisper
KinyaWhisper is a fine - tuned OpenAI's Whisper model for Kinyarwanda automatic speech recognition, offering a reproducible baseline for low - resource indigenous languages.
๐ Quick Start
KinyaWhisper is a fine - tuned version of OpenAIโs Whisper model for automatic speech recognition (ASR) in Kinyarwanda. It was trained on 102 manually labeled .wav files and serves as a reproducible baseline for speech recognition in low - resource, indigenous languages.
๐ป Usage Examples
Basic Usage
from transformers import WhisperProcessor, WhisperForConditionalGeneration
import torchaudio
model = WhisperForConditionalGeneration.from_pretrained("benax - rw/KinyaWhisper")
processor = WhisperProcessor.from_pretrained("benax - rw/KinyaWhisper")
waveform, sample_rate = torchaudio.load("your_audio.wav")
inputs = processor(waveform.squeeze(), sampling_rate = sample_rate, return_tensors = "pt")
predicted_ids = model.generate(inputs["input_features"])
transcription = processor.batch_decode(predicted_ids, skip_special_tokens = True)[0]
print("๐ฃ๏ธ Transcription:", transcription)
๐ง Technical Details
Property |
Details |
Model Type |
openai/whisper - small |
Epochs |
80 |
Batch size |
4 |
Learning rate |
1e - 5 |
Optimizer |
Adam |
Final loss |
0.00024 |
WER |
51.85% |
โ ๏ธ Important Note
The model was trained on a small dataset (102 samples). It performs best on short, clear Kinyarwanda utterances and may struggle with longer or noisy audio. This is an early - stage educational model, not yet suitable for production use.
๐ Documentation
Model Index
- Name: KinyaWhisper
- Results:
- Task:
- Name: Automatic Speech Recognition
- Type: automatic - speech - recognition
- Dataset:
- Name: KinyaWhisper Custom Dataset
- Type: custom
- Config: kinyarwanda
- Metrics:
- Name: WER
- Type: wer
- Value: 51.85
Citation
If you use this model, please cite:
@misc{baziramwabo2025kinyawhisper,
author = {Gabriel Baziramwabo},
title = {KinyaWhisper: Fine - Tuning Whisper for Kinyarwanda ASR},
year = {2025},
publisher = {Hugging Face},
howpublished = {\url{https://huggingface.co/benax - rw/KinyaWhisper}},
note = {Version 1.0}
}
๐ License
This project is licensed under the MIT license.
๐ฌ Contact
Maintained by Gabriel Baziramwabo.
โ๏ธ gabriel@benax.rw
๐ https://benax.rw