WHISPER-SMALL-SWAHILI-ASR-CV-14 Open-source Speech Recognition Model

WHISPER SMALL SWAHILI ASR CV 14

Developed by dmusingu

This model is a fine-tuned speech recognition model based on OpenAI's Whisper large on the Common Voice 14.0 Swahili (SW) dataset, achieving a word error rate (WER) of 25.13%.

Speech Recognition

Transformers

OtherOpen Source License:Apache-2.0 #Swahili ASR #Low Word Error Rate #Speech-to-Text

Downloads 28

Release Time : 4/19/2024

Model Overview

An automatic speech recognition (ASR) model optimized for Swahili, fine-tuned based on the Whisper architecture, suitable for speech-to-text tasks.

Model Features

Low Word Error Rate

Achieves a word error rate (WER) of 25.13% on the Common Voice 14.0 Swahili test set

Based on Whisper Architecture

Fine-tuned on OpenAI's powerful Whisper-large model, inheriting its excellent speech recognition capabilities

Optimized Specifically for Swahili

Trained on the Common Voice 14.0 Swahili dataset for better recognition performance in this language

Model Capabilities

Speech-to-Text

Swahili Speech Recognition

Long Audio Processing

Use Cases

Speech Transcription

Swahili Speech Transcription

Convert Swahili speech content into text

Word error rate 25.13%, character error rate 9.83%

Voice Assistants

Swahili Voice Assistant

Provide voice interaction capabilities for Swahili users

Property	Details
Model Type	Fine - tuned version of openai/whisper-large
Training Data	mozilla - foundation/common_voice_14_0
Metrics	Wer

Training Loss	Epoch	Step	Cer	Validation Loss	Wer
0.9179	0.51	800	0.1412	0.5355	0.3693
0.3078	1.02	1600	0.1196	0.4343	0.3152
0.1959	1.53	2400	0.1172	0.4068	0.2822
0.1737	2.04	3200	0.1145	0.3922	0.2721
0.1046	2.55	4000	0.1084	0.3958	0.2634
0.1019	3.06	4800	0.1029	0.3957	0.2578
0.0588	3.57	5600	0.1132	0.4013	0.2666
0.0545	4.08	6400	0.1009	0.4112	0.2510
0.0305	4.59	7200	0.0941	0.4183	0.2442
0.0275	5.1	8000	0.1005	0.4303	0.2549
0.0153	5.61	8800	0.4374	0.2407	0.0908
0.014	6.12	9600	0.4428	0.2513	0.0983

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

WHISPER SMALL SWAHILI ASR CV 14

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Whisper small - Denis Musinguzi

📚 Documentation

Model Information

Evaluation Results

Model Index

🔧 Technical Details

Training Hyperparameters

Training Results

Framework Versions

📄 License