W

Wav2vec2 Large Xls R 300m Hi Cv8 B2

Developed by DrishtiSharma
This is an automatic speech recognition (ASR) model fine-tuned on the Hindi Common Voice 8.0 dataset, based on Facebook's wav2vec2-xls-r-300m model.
Downloads 22
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Hindi automatic speech recognition tasks, trained on the Common Voice 8.0 dataset, achieving a low word error rate (WER).

Model Features

High-performance Hindi recognition
Achieved a word error rate (WER) of 38.9% and a character error rate (CER) of 13.0% on the Hindi test set of Common Voice 8.0
Based on XLS-R architecture
Uses Facebook's wav2vec2-XLS-R-300m as the base model, featuring powerful speech feature extraction capabilities
Fine-tuned
Optimized model performance through 35 training epochs using linear learning rate scheduling and warm-up strategies

Model Capabilities

Hindi speech recognition
Speech-to-text
Robust speech event detection

Use Cases

Speech transcription
Hindi speech-to-text
Convert Hindi speech content into text
Achieved 38.9% WER on the test set
Voice assistants
Hindi voice command recognition
Recognize and understand Hindi voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase