W

Wav2vec2 Xls R Myv A1

Developed by DrishtiSharma
This model is an automatic speech recognition (ASR) model fine-tuned on the Erzya language (MYV) dataset based on facebook/wav2vec2-xls-r-300m, achieving a 65.15% word error rate (WER) on the Common Voice 8 test set.
Downloads 24
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition model for the Erzya language, fine-tuned based on the wav2vec2 XLS-R architecture, suitable for converting Erzya speech to text.

Model Features

Multilingual support
Optimized specifically for the Erzya language, suitable for speech recognition tasks in low-resource languages.
Based on XLS-R architecture
Utilizes Facebook's wav2vec2 XLS-R 300M parameter model as the base, featuring powerful speech feature extraction capabilities.
Fine-tuned on Common Voice dataset
Fine-tuned using Mozilla Common Voice 8.0's Erzya language data to adapt to specific linguistic characteristics.

Model Capabilities

Speech-to-text
Erzya language recognition
Automatic speech recognition

Use Cases

Speech transcription
Erzya speech transcription
Convert Erzya language speech content into text
Achieved a 65.15% word error rate (WER) on the test set.
Language preservation
Digitization of minority languages
Assist in recording and digitizing speech materials of minority languages like Erzya
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase