Wav2vec2 Ksponspeech
Korean speech recognition model fine-tuned on the Ksponspeech dataset, optimized based on Wav2vec2-large-xlsr-53
Downloads 111
Release Time : 6/11/2022
Model Overview
This model is an automatic speech recognition (ASR) model optimized for Korean, specifically designed for Korean speech-to-text tasks, achieving a word error rate (WER) of 0.373 on third-party test sets
Model Features
Korean optimization
Fine-tuned specifically for Korean characteristics on the Ksponspeech dataset
High performance
Achieves a word error rate (WER) of 0.373 on third-party test sets
Clear improvement areas
Identified specific optimization directions such as digit/character normalization and pronunciation correction
Model Capabilities
Korean speech recognition
High-accuracy speech-to-text conversion
Handling non-standard Korean pronunciation
Use Cases
Speech transcription
Korean meeting minutes
Automatically convert Korean meeting recordings into text transcripts
Word error rate 0.373
Media subtitle generation
Automatically generate subtitles for Korean video content
Featured Recommended AI Models
Š 2025AIbase