W

Wav2vec2 Large Xls R 300m Kyrgyz

Developed by infinitejoy
This is an automatic speech recognition (ASR) model fine-tuned on Kyrgyz speech datasets based on the facebook/wav2vec2-xls-r-300m model
Downloads 17
Release Time : 3/2/2022

Model Overview

This model is specifically optimized for the Kyrgyz language, capable of converting Kyrgyz audio into text, suitable for applications such as speech transcription

Model Features

Multilingual support
Based on the XLS-R architecture, capable of handling multiple languages
Efficient speech recognition
Performs well on Kyrgyz speech recognition tasks
Pre-training + fine-tuning architecture
Utilizes large-scale pre-trained models and achieves better performance through fine-tuning with specific language data

Model Capabilities

Kyrgyz speech recognition
Audio to text
Speech transcription

Use Cases

Speech transcription
Kyrgyz speech to text
Convert Kyrgyz speech content into editable text
Word Error Rate (WER) 40.9%, Character Error Rate (CER) 11.0%
Voice assistant
Kyrgyz voice command recognition
Used for building voice assistant systems that support the Kyrgyz language
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase