W

Wav2vec2 Large Xlsr Kinyarwanda Apostrophied

Developed by lucio
A fine-tuned model based on facebook/wav2vec2-large-xlsr-53 for Kinyarwanda, capable of predicting apostrophes in marked pronouns and vowel-initial word contractions
Downloads 28
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model for Kinyarwanda, built on the Wav2Vec2 architecture and fine-tuned on the Common Voice dataset, with special optimization for apostrophe recognition.

Model Features

Apostrophe Recognition Optimization
Compared to similar models, this model specifically optimizes the recognition of apostrophes in marked pronouns and vowel-initial word contractions
Data Filtering for Training
Only high-quality speech segments from the Common Voice dataset without downvotes and with durations not exceeding 9.5 seconds were used for training
Efficient Training
Achieved efficient training through data chunking strategy (32k sample chunks), completing approximately 60 hours of training on a single V100 GPU

Model Capabilities

Kinyarwanda speech recognition
16kHz audio processing
Continuous speech-to-text conversion

Use Cases

Speech Transcription
Kinyarwanda Speech Transcription
Convert Kinyarwanda speech content into text format
Test word error rate 39.92%
Voice Assistants
Kinyarwanda Voice Command Recognition
Provide speech recognition capabilities for Kinyarwanda voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase