Wav2vec2 From Scratch Finetune Dummy
W
Wav2vec2 From Scratch Finetune Dummy
Developed by inergi
This is an Indonesian automatic speech recognition model based on the XLSR Wav2Vec2 architecture, developed by cahya and fine-tuned on the Common Voice Indonesian dataset.
Downloads 15
Release Time : 3/2/2022
Model Overview
This model is specifically designed for automatic speech recognition tasks in Indonesian, capable of converting Indonesian speech into text.
Model Features
XLSR Fine-tuning
Fine-tuned based on the XLSR Wav2Vec2 architecture, optimizing recognition performance for Indonesian.
Low Word Error Rate
Achieves a word error rate (WER) of 25.86% on the Common Voice Indonesian test set.
Multilingual Foundation
Pre-trained model based on Cross-Lingual Speech Representation (XLSR), featuring excellent speech feature extraction capabilities.
Model Capabilities
Indonesian Speech Recognition
Speech-to-Text
Use Cases
Speech Transcription
Meeting Minutes
Automatically transcribe Indonesian meeting recordings into text records.
Accuracy approximately 74.14% (based on WER metric)
Voice Assistant
Provide speech recognition capabilities for Indonesian voice assistants.
Education
Language Learning Apps
Help learners practice Indonesian pronunciation and listening.
Featured Recommended AI Models