Whisper Large V3 Ft Cv16 Mn
A speech recognition model fine-tuned on the Common Voice 16.0 dataset based on OpenAI Whisper Large V3
Downloads 34
Release Time : 1/22/2024
Model Overview
This model is a fine-tuned version of OpenAI Whisper Large V3, focusing on automatic speech recognition (ASR) tasks, achieving a 35.22% word error rate on the Common Voice dataset.
Model Features
High-precision speech recognition
Achieves a 35.22% word error rate on the Common Voice test set, demonstrating excellent performance.
Multilingual support
Based on the Whisper architecture, capable of processing multiple languages.
Efficient fine-tuning
Targeted training on the base model improves recognition accuracy in specific domains.
Model Capabilities
Speech-to-text
Multilingual speech recognition
Long audio processing
Use Cases
Speech transcription
Automatic meeting minutes generation
Automatically convert meeting recordings into text transcripts
Approximately 65% accuracy (inferred based on WER metric)
Podcast subtitle generation
Automatically generate subtitles for podcast content
Assistive technology
Hearing impairment assistance
Real-time speech-to-text assistance for the hearing impaired
Featured Recommended AI Models
Š 2025AIbase