W

Wav2vec2 Large Xlsr 53 Mongolian

Developed by tugstugi
An automatic speech recognition model fine-tuned on the Common Voice Mongolian dataset based on facebook/wav2vec2-large-xlsr-53
Downloads 251
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition model optimized for Mongolian, fine-tuned based on the XLSR-53 architecture, suitable for Mongolian speech-to-text tasks.

Model Features

Mongolian Optimization
Specifically fine-tuned for Mongolian speech characteristics to improve recognition accuracy
Based on XLSR-53 Architecture
Utilizes the powerful wav2vec2-large-xlsr-53 as the base model
16kHz Sampling Rate Support
Supports standard 16kHz sampling rate audio input

Model Capabilities

Mongolian speech recognition
Speech-to-text
Automatic speech recognition

Use Cases

Speech Transcription
Mongolian Speech Transcription
Convert Mongolian speech content into text
Achieved a WER of 42.80% on the Common Voice Mongolian test set
Voice Assistants
Mongolian Voice Command Recognition
Used for voice command recognition in Mongolian voice assistants or control systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase