W

Wav2vec2 From Scratch Finetune Dummy

Developed by inergi
This is an Indonesian automatic speech recognition model based on the XLSR Wav2Vec2 architecture, developed by cahya and fine-tuned on the Common Voice Indonesian dataset.
Downloads 15
Release Time : 3/2/2022

Model Overview

This model is specifically designed for automatic speech recognition tasks in Indonesian, capable of converting Indonesian speech into text.

Model Features

XLSR Fine-tuning
Fine-tuned based on the XLSR Wav2Vec2 architecture, optimizing recognition performance for Indonesian.
Low Word Error Rate
Achieves a word error rate (WER) of 25.86% on the Common Voice Indonesian test set.
Multilingual Foundation
Pre-trained model based on Cross-Lingual Speech Representation (XLSR), featuring excellent speech feature extraction capabilities.

Model Capabilities

Indonesian Speech Recognition
Speech-to-Text

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe Indonesian meeting recordings into text records.
Accuracy approximately 74.14% (based on WER metric)
Voice Assistant
Provide speech recognition capabilities for Indonesian voice assistants.
Education
Language Learning Apps
Help learners practice Indonesian pronunciation and listening.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase