D

Dvoice Darija

Developed by aioxlabs
This model is an automatic speech recognition system trained on the DVoice Dari dataset, using the wav2vec 2.0 architecture, supporting transcription of Moroccan Arabic dialect.
Downloads 22
Release Time : 5/25/2022

Model Overview

An end-to-end automatic speech recognition system specifically optimized for Darija (Moroccan Arabic dialect), capable of converting speech to text.

Model Features

Low-resource language optimization
Specially optimized for Darija as a low-resource language
Pre-trained model fine-tuning
Fine-tuned based on the facebook/wav2vec2-large-xlsr-53 pre-trained model
Community-driven data
Uses real community-recorded data collected via the DVoice platform
End-to-end solution
Provides a complete workflow from audio preprocessing to text output

Model Capabilities

Dari speech recognition
Audio transcription
Speech-to-text

Use Cases

Speech transcription
Darija speech transcription
Converts Moroccan Arabic dialect speech content into text
Validation set CER 5.51%, WER 18.46%
Language technology development
Darija voice application development
Serves as a foundational model for Darija voice assistants and customer service systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase