Wav2vec2 Xls R 300m Indonesian
An automatic speech recognition model fine-tuned on Indonesian speech data based on Facebook's XLS-R-300M model
Downloads 4,486
Release Time : 3/2/2022
Model Overview
This model is an automatic speech recognition (ASR) model optimized for Indonesian, based on Facebook's wav2vec2-xls-r-300m architecture, fine-tuned on Common Voice 8.0 and MagicHub Indonesian conversational speech corpus.
Model Features
High-performance Indonesian recognition
Achieves a word error rate (WER) of 5.046% and a character error rate (CER) of 1.699% on the Common Voice 8 test set
Multi-dataset training
Combined training on Common Voice 8.0 and MagicHub Indonesian conversational speech corpus
Robustness evaluation
Performance evaluated on robust speech challenge datasets, demonstrating recognition capabilities under various conditions
Model Capabilities
Indonesian speech recognition
Speech-to-text
Automatic speech transcription
Use Cases
Speech transcription
Voice assistants
Used as the speech recognition component for Indonesian voice assistants
Meeting minutes
Automatically transcribe Indonesian meeting content
Accessibility technology
Real-time caption generation
Generate real-time captions for Indonesian video content
Featured Recommended AI Models