W

Whisper Tamil Small

Developed by vasista22
A Tamil automatic speech recognition model fine-tuned based on OpenAI Whisper-small, trained on multiple public datasets with excellent word error rate performance.
Downloads 10.78k
Release Time : 1/1/2023

Model Overview

This model is an automatic speech recognition (ASR) model optimized specifically for Tamil, fine-tuned on the Whisper-small architecture, suitable for Tamil speech-to-text tasks.

Model Features

Low word error rate
WER of only 7.95 on the Common Voice 11.0 Tamil test set and 9.11 on the Fleurs test set.
Multi-dataset training
Incorporates training data from 6 mainstream Tamil ASR datasets.
Accelerated inference support
Provides JAX-accelerated inference solutions based on whisper-jax, supporting batch processing.

Model Capabilities

Tamil speech recognition
Long audio processing (supports chunking)
Real-time transcription

Use Cases

Speech transcription
Meeting minutes
Convert Tamil meeting recordings into text transcripts.
Highly accurate transcriptions.
Media subtitle generation
Automatically generate subtitles for Tamil video content.
Accurate subtitles with WER below 10%.
Voice assistants
Tamil voice command recognition
Used for localized voice assistant development.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase