W

Wav2vec2 Large Xls R 300m Or D5

Developed by DrishtiSharma
This is an automatic speech recognition (ASR) model fine-tuned on the Odia dataset based on facebook/wav2vec2-xls-r-300m, specifically designed for Odia speech-to-text tasks.
Downloads 24
Release Time : 3/2/2022

Model Overview

This model is a speech recognition model fine-tuned on the Mozilla Common Voice 8.0 Odia dataset, capable of converting Odia speech into text.

Model Features

Specialized for Odia
A speech recognition model specifically optimized for Odia
Based on large-scale pre-trained model
Fine-tuned on the facebook/wav2vec2-xls-r-300m model, inheriting its powerful speech feature extraction capabilities
Relatively low CER
Achieved a character error rate (CER) of 15.72% on the test set

Model Capabilities

Odia speech recognition
Speech-to-text
Long audio processing (supports chunk processing)

Use Cases

Speech transcription
Odia speech transcription
Convert Odia speech content into text
Test set WER 57.91%, CER 15.72%
Voice assistant
Odia voice command recognition
Used as a front-end recognition module for Odia voice assistants or voice control systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase