Whisper Large V3 Turbo STT Zeroth KO V2
A Korean automatic speech recognition model optimized based on Whisper Large v3 Turbo, providing high-accuracy transcription with timestamps
Downloads 662
Release Time : 2/3/2025
Model Overview
This model is an optimized version of openai/whisper-large-v3-turbo, specifically fine-tuned for Korean automatic speech recognition (ASR) tasks, aiming to provide high-accuracy speech transcription.
Model Features
Korean Optimization
Specially fine-tuned for Korean speech recognition, providing higher transcription accuracy
Timestamp Support
Transcription results include timestamp information for easy audio content localization
Incremental Fine-tuning
Adopts a phased incremental fine-tuning strategy to continuously optimize model performance
Data Augmentation
Applies 20% random data augmentation during training to improve model robustness
Model Capabilities
Korean speech recognition
Timestamped transcription
High-accuracy speech-to-text
Use Cases
Speech Transcription
Korean Meeting Minutes
Automatically transcribe Korean meeting recordings into timestamped text
Word error rate 19.9134%, character error rate 0.0660%
Korean Media Subtitle Generation
Automatically generate subtitles for Korean video content
Speech Analysis
Korean Speech Content Analysis
Analyze Korean speech content to extract key information
Featured Recommended AI Models
Š 2025AIbase