XLSR WithLM Malayalam
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the IMaSC, Indic TTS Malayalam, and OpenSLR Malayalam training datasets, supporting automatic speech recognition for Malayalam.
Downloads 19
Release Time : 7/22/2024
Model Overview
This is an optimized automatic speech recognition model for Malayalam, enhanced with a trigram language model trained using the KENLM library, demonstrating excellent performance across multiple Malayalam datasets.
Model Features
Multi-dataset fine-tuning
Fine-tuned on multiple Malayalam datasets including IMaSC, Indic TTS Malayalam, and OpenSLR Malayalam, improving recognition accuracy.
Language model enhancement
Post-processed with a trigram language model trained on the ml-sentences dataset using the KENLM library, significantly improving recognition performance.
Efficient training
Utilized techniques such as gradient accumulation and mixed-precision training to achieve efficient training with limited resources.
Model Capabilities
Malayalam speech recognition
Speech-to-text
Use Cases
Speech transcription
Malayalam speech transcription
Convert Malayalam speech content into text
Achieved a WER of 27.3 on the OpenSLR Malayalam test set
Voice assistants
Malayalam voice assistant
Used to build voice assistant applications supporting Malayalam
Featured Recommended AI Models
Š 2025AIbase