W

Wav2vec2 Xls R 300m Indonesian

Developed by Wikidepia
An automatic speech recognition model fine-tuned on Indonesian speech data based on Facebook's XLS-R-300M model
Downloads 4,486
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) model optimized for Indonesian, based on Facebook's wav2vec2-xls-r-300m architecture, fine-tuned on Common Voice 8.0 and MagicHub Indonesian conversational speech corpus.

Model Features

High-performance Indonesian recognition
Achieves a word error rate (WER) of 5.046% and a character error rate (CER) of 1.699% on the Common Voice 8 test set
Multi-dataset training
Combined training on Common Voice 8.0 and MagicHub Indonesian conversational speech corpus
Robustness evaluation
Performance evaluated on robust speech challenge datasets, demonstrating recognition capabilities under various conditions

Model Capabilities

Indonesian speech recognition
Speech-to-text
Automatic speech transcription

Use Cases

Speech transcription
Voice assistants
Used as the speech recognition component for Indonesian voice assistants
Meeting minutes
Automatically transcribe Indonesian meeting content
Accessibility technology
Real-time caption generation
Generate real-time captions for Indonesian video content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase