W

Wav2vec2 Large Xls R 300m Bn Colab

Developed by rhr99
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the common_voice_9_0 dataset, supporting Bengali.
Downloads 18
Release Time : 6/23/2022

Model Overview

This is a speech recognition model optimized for Bengali, fine-tuned based on the wav2vec2-xls-r-300m architecture, suitable for speech-to-text tasks.

Model Features

Fine-tuned based on large-scale pre-trained model
Optimized for Bengali based on facebook/wav2vec2-xls-r-300m
Multilingual support
Focused on Bengali speech recognition while potentially retaining the original model's multilingual capabilities
Efficient training
Optimized training efficiency using techniques like mixed-precision training and gradient accumulation

Model Capabilities

Speech recognition
Audio-to-text conversion
Bengali language processing

Use Cases

Speech transcription
Bengali speech-to-text
Convert Bengali speech content into text
Word Error Rate (WER) 0.9861
Voice assistants
Bengali voice command recognition
Used for understanding Bengali voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase