Wav2vec2 Large Xls R 300m Bn Colab
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the common_voice_9_0 dataset, supporting Bengali.
Downloads 18
Release Time : 6/23/2022
Model Overview
This is a speech recognition model optimized for Bengali, fine-tuned based on the wav2vec2-xls-r-300m architecture, suitable for speech-to-text tasks.
Model Features
Fine-tuned based on large-scale pre-trained model
Optimized for Bengali based on facebook/wav2vec2-xls-r-300m
Multilingual support
Focused on Bengali speech recognition while potentially retaining the original model's multilingual capabilities
Efficient training
Optimized training efficiency using techniques like mixed-precision training and gradient accumulation
Model Capabilities
Speech recognition
Audio-to-text conversion
Bengali language processing
Use Cases
Speech transcription
Bengali speech-to-text
Convert Bengali speech content into text
Word Error Rate (WER) 0.9861
Voice assistants
Bengali voice command recognition
Used for understanding Bengali voice commands
Featured Recommended AI Models
Š 2025AIbase