W

Wav2vec2 Large Xlsr Kyrgyz

Developed by aismlv
A Kyrgyz speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on Common Voice dataset with a word error rate of 34.08%.
Downloads 571
Release Time : 3/2/2022

Model Overview

This is a specialized model for Kyrgyz speech recognition, based on the Wav2Vec2-XLSR architecture, suitable for converting Kyrgyz audio into text.

Model Features

High Accuracy Kyrgyz Recognition
A speech recognition model specifically optimized for Kyrgyz language, achieving 34.08% word error rate on Common Voice test set
Based on XLSR Architecture
Utilizes large-scale cross-lingual representation learning pre-trained model with powerful speech feature extraction capabilities
16kHz Sampling Rate Support
Optimized for 16kHz sampled audio input, ensure matching audio sampling rate when using

Model Capabilities

Kyrgyz speech recognition
Audio to text
Automatic speech transcription

Use Cases

Speech Transcription
Kyrgyz Speech Transcription
Convert Kyrgyz speech content into editable text format
Word error rate 34.08%
Voice Assistants
Kyrgyz Voice Command Recognition
Provide speech recognition capability for Kyrgyz voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase