Wav2vec2 Xls R 300m Bangla Command Generated Data Finetune
Bengali speech recognition model based on wav2vec2-xls-r-300m architecture, fine-tuned for command recognition tasks
Downloads 24
Release Time : 3/2/2022
Model Overview
This model is a fine-tuned version of hrdipto/wav2vec2-xls-r-300m-bangla-command-data, specifically designed for Bengali command recognition tasks
Model Features
Efficient speech recognition
Optimized for Bengali command recognition with an evaluated word error rate of only 0.0208
Fast inference
Can process 75.217 samples per second during evaluation, suitable for real-time applications
Transfer learning
Fine-tuned based on the pre-trained wav2vec2-xls-r-300m model, fully leveraging pre-trained knowledge
Model Capabilities
Bengali speech recognition
Command word recognition
Real-time speech processing
Use Cases
Smart home control
Voice-controlled devices
Control smart home devices using Bengali voice commands
High-accuracy command recognition
Voice assistants
Localized voice interaction
Provide voice interaction functionality for Bengali users
Low-latency speech recognition
Featured Recommended AI Models