W

Wav2vec2 Large Xlsr Mongolian

Developed by manandey
An automatic speech recognition model fine-tuned on the Mongolian Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Downloads 4,719
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) model optimized for Mongolian, based on the Wav2Vec2 architecture, suitable for converting Mongolian speech to text.

Model Features

Mongolian optimization
Specifically fine-tuned for Mongolian speech recognition, enhancing comprehension of Mongolian speech.
XLSR pre-training based
Fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, leveraging cross-lingual speech representation learning.
16kHz sampling rate support
Supports speech input at 16kHz sampling rate, suitable for most speech application scenarios.

Model Capabilities

Mongolian speech recognition
Speech-to-text

Use Cases

Speech transcription
Mongolian speech transcription
Convert Mongolian speech content into editable text format
Achieved a WER of 43.08% on the Common Voice Mongolian test set
Voice assistants
Mongolian voice command recognition
Used for developing Mongolian-language voice assistants and voice control applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase