Whisper-large-v2-Ko Open-Source Korean Speech Recognition Model - Free Deployment for Precise Korean Speech Recognition

Whisper Large V2 Ko

Developed by byoussef

Korean automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper-large-v2, excelling on Korean datasets

Speech Recognition

Transformers

KoreanOpen Source License:Apache-2.0 #Korean speech recognition #Low word error rate #Multi-GPU training

Downloads 94

Release Time : 3/10/2023

Model Overview

This model is a Korean fine-tuned version of OpenAI Whisper-large-v2, specifically optimized for Korean speech recognition tasks, achieving a 2.9% word error rate (WER) on the Zeroth Korean dataset

Model Features

Low word error rate

Only 2.9% word error rate on Korean test sets, demonstrating excellent performance

Multi-GPU training

Efficient training using 7 GPUs with a total training batch size of 224

Optimized training process

Adopted linear learning rate scheduling with 500 warm-up steps, achieving optimal results after 50 epochs

Model Capabilities

Korean speech recognition

Speech-to-text

High-accuracy transcription

Use Cases

Speech transcription

Korean meeting minutes

Automatically transcribe Korean meeting recordings into text

Highly accurate written records

Korean voice assistant

Provide speech recognition capabilities for Korean voice assistants

Accurate voice command recognition

Education

Korean learning applications

Help Korean learners check pronunciation accuracy

Provide accurate pronunciation feedback

Training Loss	Epoch	Step	Validation Loss	Wer
0.0299	10.0	1000	0.0745	0.0447
0.0085	20.0	2000	0.0608	0.0353
0.0036	30.0	3000	0.0593	0.0302
0.0013	40.0	4000	0.0609	0.0282
0.0008	50.0	5000	0.0617	0.0290

Property	Details
Model Type	Fine - tuned version of openai/whisper - large - v2
Training Data	None
Metrics	Wer (Bingsu/zeroth - korean: 2.9; google/fleurs (ko_kr, test split): 20.66)
Base Model	openai/whisper - large - v2

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Whisper Large V2 Ko

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 whisper-large-v2-Ko

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License

📦 Information Table