W

Whisper Large V3 Turbo STT Zeroth KO V2

Developed by o0dimplz0o
A Korean automatic speech recognition model optimized based on Whisper Large v3 Turbo, providing high-accuracy transcription with timestamps
Downloads 662
Release Time : 2/3/2025

Model Overview

This model is an optimized version of openai/whisper-large-v3-turbo, specifically fine-tuned for Korean automatic speech recognition (ASR) tasks, aiming to provide high-accuracy speech transcription.

Model Features

Korean Optimization
Specially fine-tuned for Korean speech recognition, providing higher transcription accuracy
Timestamp Support
Transcription results include timestamp information for easy audio content localization
Incremental Fine-tuning
Adopts a phased incremental fine-tuning strategy to continuously optimize model performance
Data Augmentation
Applies 20% random data augmentation during training to improve model robustness

Model Capabilities

Korean speech recognition
Timestamped transcription
High-accuracy speech-to-text

Use Cases

Speech Transcription
Korean Meeting Minutes
Automatically transcribe Korean meeting recordings into timestamped text
Word error rate 19.9134%, character error rate 0.0660%
Korean Media Subtitle Generation
Automatically generate subtitles for Korean video content
Speech Analysis
Korean Speech Content Analysis
Analyze Korean speech content to extract key information
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase