W

Whisper Tamil Large V2

Developed by vasista22
Tamil speech recognition model fine-tuned based on OpenAI Whisper-large-v2, trained on multiple public Tamil ASR corpora
Downloads 325
Release Time : 1/1/2023

Model Overview

An automatic speech recognition model optimized for Tamil, suitable for transcription tasks across various accents and dialects

Model Features

Multi-dataset fine-tuning
Trained on 6 different sources of Tamil ASR datasets, covering a wide range of speech characteristics
Low word error rate
Achieves WER of only 6.61% on Common Voice 11.0 test set and 7.5% WER on Fleurs test set
Efficient inference support
Provides two inference solutions: standard transformers and whisper-jax, supporting batch processing and GPU acceleration

Model Capabilities

Tamil speech transcription
Long audio processing (supports chunking)
Accent adaptation

Use Cases

Speech transcription services
Tamil media content subtitle generation
Automatically generates subtitles for video/podcast media content
Achieves 93.39% accuracy on Common Voice test set
Voice assistant development
Tamil voice command recognition
Used to develop smart voice assistants supporting Tamil
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase