W

Wav2vec2 Large Xlsr 53 Kyrgyz

Developed by anton-l
This is a Kyrgyz automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using public speech datasets.
Downloads 32
Release Time : 3/2/2022

Model Overview

This model is specifically designed for automatic speech recognition tasks in Kyrgyz, capable of converting Kyrgyz speech into text.

Model Features

High-accuracy speech recognition
Achieves a word error rate of 31.88% on Kyrgyz test sets
Based on XLSR architecture
Utilizes large-scale pre-trained models for cross-lingual speech representation learning
No language model required
Can be used directly without additional language model support

Model Capabilities

Kyrgyz speech recognition
16kHz audio processing

Use Cases

Speech-to-text
Kyrgyz speech transcription
Convert Kyrgyz speech content into editable text
Word error rate 31.88%
Voice assistants
Kyrgyz voice command recognition
Used to understand Kyrgyz voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase