W

Wav2vec2 Large Xlsr Korean

Developed by kresnik
Korean Automatic Speech Recognition (ASR) model based on Wav2Vec2 XLSR architecture, excelling on the Zeroth Korean dataset
Downloads 1.7M
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Korean speech recognition tasks, capable of converting Korean speech to text with high accuracy and low error rates

Model Features

High Accuracy
Achieves a word error rate (WER) of 4.74% and a character error rate (CER) of 1.78% on the Zeroth Korean test set
Large Model Architecture
Based on the large-scale Wav2Vec2 XLSR architecture, suitable for Korean speech recognition tasks
Pre-trained Model
Provides pre-trained model weights ready for inference or fine-tuning

Model Capabilities

Korean Speech Recognition
Audio to Text
Automatic Speech Transcription

Use Cases

Speech Transcription
Korean Meeting Minutes
Automatically converts Korean meeting recordings into text transcripts
Accuracy up to 95.26% (WER 4.74%)
Voice Assistant
Speech recognition module for Korean voice assistant applications
Education
Korean Learning App
Helps Korean learners check pronunciation accuracy
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase