X

XLSR WithLM Malayalam

Developed by kavyamanohar
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the IMaSC, Indic TTS Malayalam, and OpenSLR Malayalam training datasets, supporting automatic speech recognition for Malayalam.
Downloads 19
Release Time : 7/22/2024

Model Overview

This is an optimized automatic speech recognition model for Malayalam, enhanced with a trigram language model trained using the KENLM library, demonstrating excellent performance across multiple Malayalam datasets.

Model Features

Multi-dataset fine-tuning
Fine-tuned on multiple Malayalam datasets including IMaSC, Indic TTS Malayalam, and OpenSLR Malayalam, improving recognition accuracy.
Language model enhancement
Post-processed with a trigram language model trained on the ml-sentences dataset using the KENLM library, significantly improving recognition performance.
Efficient training
Utilized techniques such as gradient accumulation and mixed-precision training to achieve efficient training with limited resources.

Model Capabilities

Malayalam speech recognition
Speech-to-text

Use Cases

Speech transcription
Malayalam speech transcription
Convert Malayalam speech content into text
Achieved a WER of 27.3 on the OpenSLR Malayalam test set
Voice assistants
Malayalam voice assistant
Used to build voice assistant applications supporting Malayalam
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase