W

Wav2vec2 Large Xlsr Kazakh

Developed by aismlv
This is a Kazakh automatic speech recognition (ASR) model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on the Kazakh speech corpus v1.1 with a test WER of 19.65%.
Downloads 12.08k
Release Time : 3/2/2022

Model Overview

This model is specifically designed for automatic speech recognition tasks in Kazakh, supporting voice input with a 16kHz sampling rate.

Model Features

High-accuracy Kazakh recognition
Achieves a word error rate (WER) of 19.65% on the Kazakh speech corpus v1.1
Based on XLSR-53 architecture
Utilizes a large-scale cross-lingual speech representation learning model for fine-tuning
No language model required
Can be used directly without additional language model support

Model Capabilities

Kazakh speech recognition
16kHz audio processing

Use Cases

Speech-to-text
Kazakh speech transcription
Convert Kazakh speech content into text
Word error rate 19.65%
Voice assistant
Kazakh voice command recognition
Used for command recognition in Kazakh voice assistant systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase