Wav2vec2 Large Xls R 300m Kyrgyz
This is an automatic speech recognition (ASR) model fine-tuned on Kyrgyz speech datasets based on the facebook/wav2vec2-xls-r-300m model
Downloads 17
Release Time : 3/2/2022
Model Overview
This model is specifically optimized for the Kyrgyz language, capable of converting Kyrgyz audio into text, suitable for applications such as speech transcription
Model Features
Multilingual support
Based on the XLS-R architecture, capable of handling multiple languages
Efficient speech recognition
Performs well on Kyrgyz speech recognition tasks
Pre-training + fine-tuning architecture
Utilizes large-scale pre-trained models and achieves better performance through fine-tuning with specific language data
Model Capabilities
Kyrgyz speech recognition
Audio to text
Speech transcription
Use Cases
Speech transcription
Kyrgyz speech to text
Convert Kyrgyz speech content into editable text
Word Error Rate (WER) 40.9%, Character Error Rate (CER) 11.0%
Voice assistant
Kyrgyz voice command recognition
Used for building voice assistant systems that support the Kyrgyz language
Featured Recommended AI Models
Š 2025AIbase