X

Xls R Uyghur Cv8

Developed by lucio
An automatic speech recognition model fine-tuned on the Common Voice 8 Uyghur dataset based on facebook/wav2vec2-xls-r-300m
Downloads 24
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model optimized for Uyghur, suitable for speech-to-text scenarios with low-accuracy requirements

Model Features

Uyghur optimization
Specifically optimized for the Uyghur Perso-Arabic script with punctuation removed
Progressive learning strategy
Adopts a learning rate strategy with 2000 warmup steps and 9400 cooldown steps to optimize training effectiveness
Low-resource adaptation
Achieves good recognition results even with limited data

Model Capabilities

Uyghur speech recognition
Broadcast recording transcription
Video subtitle generation

Use Cases

Media processing
Video subtitle draft generation
Automatically generates preliminary subtitles for Uyghur video content
Word error rate 30.5%, character error rate 5.8%
Broadcast recording indexing
Converts Uyghur broadcast content into searchable text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase