Wav2vec2 Large Xls R 300m Mongolian
An automatic speech recognition model fine-tuned on Mongolian datasets based on facebook/wav2vec2-xls-r-300m
Downloads 33
Release Time : 3/2/2022
Model Overview
This is an optimized automatic speech recognition (ASR) model for Mongolian, based on the XLS-R architecture and fine-tuned on the Common Voice 7.0 Mongolian dataset.
Model Features
Mongolian optimization
Specifically optimized and fine-tuned for Mongolian speech recognition
Based on XLS-R architecture
Utilizes the powerful XLS-R 300M parameter architecture with excellent speech recognition capabilities
Multi-dataset evaluation
Evaluated on multiple datasets including Common Voice and robust speech events
Model Capabilities
Mongolian speech recognition
Speech-to-text
Conversational speech processing
Use Cases
Speech transcription
Mongolian speech-to-text
Convert Mongolian speech content into text
WER of 44.7% on the Common Voice test set
Voice assistants
Mongolian voice command recognition
Speech recognition component for Mongolian voice assistants or voice control systems
Featured Recommended AI Models
Š 2025AIbase