W

W2V2 BERT Withlm Malayalam

Developed by vrclc
A Malayalam automatic speech recognition model fine-tuned based on facebook/w2v-bert-2.0, trained on multiple Malayalam datasets and using a trigram language model trained with the KENLM library.
Downloads 65
Release Time : 7/11/2024

Model Overview

This is an automatic speech recognition model for Malayalam, fine-tuned based on the facebook/w2v-bert-2.0 architecture, suitable for converting Malayalam speech to text.

Model Features

Multi-dataset Training
Fine-tuned on multiple datasets including IMaSC, MSC, OpenSLR Malayalam training set, Festvox Malayalam, CV16, and more.
Language Model Enhancement
Uses a trigram language model trained on the ml-sentences dataset with the KENLM library.
Low Word Error Rate
Achieves a 12.99% word error rate on the OpenSLR test set.

Model Capabilities

Malayalam Speech Recognition
Speech-to-Text

Use Cases

Speech Transcription
Malayalam Speech Transcription
Convert Malayalam speech to text
18.23% word error rate on the OpenSLR test set
Voice Assistant
Malayalam Voice Assistant
Supports Malayalam voice interaction
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase