W

Wav2vec2 Large Xlsr Mvc Swahili

Developed by eddiegulay
This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53, specifically designed for automatic speech recognition tasks in Swahili.
Downloads 9,413
Release Time : 11/6/2023

Model Overview

This is an optimized automatic speech recognition model for Swahili, based on the wav2vec2 architecture and fine-tuned on the Common Voice 13.0 dataset.

Model Features

Swahili optimization
Specially fine-tuned for Swahili to provide better speech recognition performance
Based on wav2vec2-large-xlsr-53
Built upon the powerful wav2vec2-large-xlsr-53 base model with excellent speech feature extraction capabilities
Low word error rate
Achieves a word error rate of 0.2 on the Common Voice test set

Model Capabilities

Swahili speech recognition
Audio transcription
Speech-to-text

Use Cases

Speech transcription
Swahili speech transcription
Convert Swahili speech content into text
Word error rate 0.2
Voice assistant
Swahili voice assistant
Build voice interaction systems supporting Swahili
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase