Wav2vec2hindia
W
Wav2vec2hindia
Developed by SAGAR4REAL
A speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-xls-r-300m
Downloads 22
Release Time : 3/28/2022
Model Overview
This model is an optimized version for speech recognition tasks in Indian languages, fine-tuned based on the wav2vec2-xls-r-300m architecture
Model Features
Based on XLS-R Architecture
Uses facebook's wav2vec2-xls-r-300m as the base model, featuring powerful speech feature extraction capabilities
Optimized for Indian Languages
Specifically fine-tuned for Indian languages, potentially improving recognition accuracy for these languages
Efficient Training Configuration
Employs mixed-precision training and gradient accumulation techniques to optimize training efficiency
Model Capabilities
Speech Recognition
Audio to Text
Indian Language Processing
Use Cases
Speech Transcription
Indian Language Speech Transcription
Convert speech content in Indian languages to text
Voice Assistants
Indian Language Voice Interaction
Provide voice interaction capabilities for Indian language users
Featured Recommended AI Models