W

Wav2vec2 Large Xls R 300m Urdu

Developed by kingabzpro
A speech recognition model fine-tuned on the Common Voice 8 Urdu dataset based on facebook/wav2vec2-xls-r-300m
Downloads 91.36k
Release Time : 3/2/2022

Model Overview

This model is an optimized automatic speech recognition (ASR) model for Urdu, based on the wav2vec2 architecture and fine-tuned on the Common Voice 8 dataset, supporting Urdu speech-to-text tasks.

Model Features

Urdu optimization
Specifically optimized for Urdu speech recognition tasks
Based on wav2vec2 architecture
Uses Facebook's wav2vec2-xls-r-300m pre-trained model as the base
Fine-tuned on Common Voice dataset
Fine-tuned on the Mozilla Common Voice 8 Urdu dataset

Model Capabilities

Urdu speech recognition
Speech-to-text
Long audio processing (supports chunk processing)

Use Cases

Speech transcription
Urdu speech transcription
Convert Urdu speech content into text
Test set WER 39.89, CER 16.7
Voice assistant
Urdu voice command recognition
Used for command recognition in Urdu voice assistant systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase