W

Wav2vec2 Large Xls R 300m Kk With LM

Developed by DrishtiSharma
This model is an automatic speech recognition (ASR) model fine-tuned on the Kazakh (KK) dataset based on facebook/wav2vec2-xls-r-300m, with language model (LM) enhancement support
Downloads 22
Release Time : 3/2/2022

Model Overview

This is an optimized automatic speech recognition model for Kazakh, trained on the Common Voice 8.0 dataset, capable of converting Kazakh speech to text

Model Features

Language model enhancement
The model incorporates a language model (LM) for post-processing, improving recognition accuracy
Multi-dataset evaluation
Evaluated on multiple datasets including Common Voice and Robust Speech Event
Large-scale pre-training
Fine-tuned based on the 300M-parameter wav2vec2-XLS-R model with powerful speech feature extraction capabilities

Model Capabilities

Kazakh speech recognition
Speech-to-text
Supports language model post-processing

Use Cases

Speech transcription
Kazakh speech transcription
Convert Kazakh speech content to text
Achieved 41.7% WER on Common Voice 8.0 test set
Voice assistants
Kazakh voice command recognition
Used for voice command recognition in Kazakh voice assistants or control systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase