W

Wav2vec2 Large Xls R 300m Mongolian

Developed by infinitejoy
An automatic speech recognition model fine-tuned on Mongolian datasets based on facebook/wav2vec2-xls-r-300m
Downloads 33
Release Time : 3/2/2022

Model Overview

This is an optimized automatic speech recognition (ASR) model for Mongolian, based on the XLS-R architecture and fine-tuned on the Common Voice 7.0 Mongolian dataset.

Model Features

Mongolian optimization
Specifically optimized and fine-tuned for Mongolian speech recognition
Based on XLS-R architecture
Utilizes the powerful XLS-R 300M parameter architecture with excellent speech recognition capabilities
Multi-dataset evaluation
Evaluated on multiple datasets including Common Voice and robust speech events

Model Capabilities

Mongolian speech recognition
Speech-to-text
Conversational speech processing

Use Cases

Speech transcription
Mongolian speech-to-text
Convert Mongolian speech content into text
WER of 44.7% on the Common Voice test set
Voice assistants
Mongolian voice command recognition
Speech recognition component for Mongolian voice assistants or voice control systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase