W

Wav2vec2 Large Xls R 300m Urdu Cv8 200epochs

Developed by omar47
Urdu speech recognition model trained on Common Voice dataset, using wav2vec 2.0 architecture
Downloads 20
Release Time : 4/20/2022

Model Overview

This model is a large-scale speech recognition system based on Facebook's wav2vec 2.0 architecture, specifically optimized for Urdu. The model was trained for 200 epochs on the Common Voice dataset with 300 million parameters.

Model Features

Large-scale pretraining
Based on the 300M-parameter wav2vec 2.0 architecture with powerful speech feature extraction capabilities
Urdu optimization
Specially trained and optimized for Urdu language, suitable for Urdu speech recognition tasks
Extended training
Thoroughly trained for 200 epochs on the Common Voice dataset

Model Capabilities

Urdu speech recognition
Speech-to-text
Automatic speech transcription

Use Cases

Speech transcription
Urdu speech transcription
Convert Urdu speech content into text
Word Error Rate (WER) of 0.7723
Voice assistants
Urdu voice assistant
Provide voice interaction capabilities for Urdu users
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase