W

Whisper Large V3 Vaani Hindi

Developed by ARTPARK-IISc
A Hindi speech recognition model fine-tuned based on OpenAI's Whisper-Large-V3, trained on approximately 718 hours of transcribed Hindi speech data
Downloads 15.55k
Release Time : 3/14/2025

Model Overview

This is an automatic speech recognition (ASR) model specifically optimized for Hindi, fine-tuned on the Whisper-large-v3 architecture, suitable for Hindi speech transcription tasks.

Model Features

Hindi optimization
Specifically fine-tuned for Hindi speech, providing more accurate transcription results
Multi-dataset training
Incorporates multiple Hindi speech datasets to enhance model generalization
Long audio processing
Supports 30-second audio chunk processing, suitable for long speech transcription

Model Capabilities

Hindi speech recognition
Long audio transcription
Multi-scenario speech processing

Use Cases

Speech transcription
Meeting minutes
Convert Hindi meeting recordings into text transcripts
Achieves a WER of 27.50 on the Gramvaani dataset
Media subtitle generation
Generate subtitles for Hindi video content
Achieves a WER of 4.38 on the IndicTTS dataset
Speech analysis
Voice assistant
Build Hindi voice interaction systems
Achieves a WER of 16.86 on the Commonvoice dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase