Whisper Large V2 Pl V2
An automatic speech recognition model fine-tuned on Polish datasets based on Whisper Large v2, supporting Polish speech-to-text tasks.
Downloads 217
Release Time : 12/14/2022
Model Overview
This is an automatic speech recognition (ASR) model specifically optimized for Polish, fine-tuned on the Common Voice 11.0 and FLEURS datasets, capable of accurately converting Polish speech into text.
Model Features
High-precision Polish recognition
Achieves a 7.28% word error rate (WER) on the Common Voice 11.0 test set, demonstrating excellent performance
Multi-dataset training
Trained using two high-quality Polish datasets: Common Voice 11.0 and FLEURS
Optimized training process
Employs carefully designed training hyperparameters and gradient accumulation strategies to ensure training effectiveness
Model Capabilities
Polish speech recognition
Speech-to-text
Automatic speech transcription
Use Cases
Speech transcription
Automated meeting minutes
Automatically converts Polish meeting recordings into text transcripts
Highly accurate transcript text
Media subtitle generation
Automatically generates subtitles for Polish video content
Low error rate subtitle output
Voice assistants
Polish voice command recognition
Used for command understanding in Polish voice assistant systems
High accuracy command recognition
Featured Recommended AI Models