W

Whisper Medium Ml

Developed by thennal
Malayalam automatic speech recognition model fine-tuned based on OpenAI Whisper-medium, trained on datasets including Common Voice 11.0
Downloads 127
Release Time : 12/12/2022

Model Overview

This model is an optimized automatic speech recognition (ASR) system for Malayalam, fine-tuned on the Whisper-medium architecture, supporting high-accuracy speech-to-text functionality

Model Features

Multi-dataset Training
Incorporates training from Common Voice 11.0, Fleurs, and multiple Malayalam-specific datasets
Optimized Error Rate
Achieves a word error rate (WER) of 11.49 on the Common Voice test set
Standardization Processing
Optimized text standardization processing pipeline for Malayalam characteristics

Model Capabilities

Malayalam speech recognition
Long audio processing (supports 30-second chunks)
Timestamped transcription (optional)

Use Cases

Speech Transcription
Speech Content Transcription
Convert Malayalam speech content into text
Achieves 88.51% word recognition accuracy on test sets
Assistive Tools
Accessibility Applications
Provides real-time caption generation for the hearing impaired
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase