X

Xls R Kyrgiz Cv8

Developed by lucio
This model is a fine-tuned automatic speech recognition model based on facebook/wav2vec2-xls-r-300m on the Common Voice 8.0 Kyrgyz dataset
Downloads 16
Release Time : 3/2/2022

Model Overview

Speech recognition model optimized for Kyrgyz language, suitable for speech-to-text conversion tasks

Model Features

Low Word Error Rate
Achieves 19.01% WER on test set (with language model)
Multi-scenario applicability
Optimized for low-fidelity speech scenarios, suitable for various practical applications
Progressive learning
Adopts progressive learning rate scheduling strategy to optimize training results

Model Capabilities

Kyrgyz speech recognition
Speech-to-text
Audio content indexing

Use Cases

Media processing
Video caption generation
Automatically generates draft subtitles for Kyrgyz video content
WER 19.01% (with language model)
Broadcast content indexing
Indexes content of recorded Kyrgyz radio programs
CER 5.38% (with language model)
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase