W

Wav2vec2 Xls R 300m Kk N2

Developed by DrishtiSharma
This is an automatic speech recognition (ASR) model fine-tuned on Kazakh (KK) speech datasets based on the facebook/wav2vec2-xls-r-300m model.
Downloads 15
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Kazakh speech recognition tasks, fine-tuned on the Common Voice 8 dataset, capable of converting Kazakh speech into text.

Model Features

Kazakh Language Optimization
Specially fine-tuned and optimized for Kazakh speech recognition
Based on Large-scale Pre-trained Model
Fine-tuned based on Facebook's wav2vec2-xls-r-300m model, inheriting its powerful speech feature extraction capabilities
Medium-sized Model
The 300M parameter size achieves a good balance between accuracy and computational efficiency

Model Capabilities

Kazakh speech recognition
Speech-to-text
Automatic speech recognition

Use Cases

Speech Transcription
Kazakh Speech Transcription
Convert Kazakh speech content into text format
WER of 0.4355 on Common Voice 8 test set
Voice Assistants
Kazakh Voice Command Recognition
Used for voice command recognition in Kazakh voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase