W

Wav2vec2 Xls R 300m Korean Lm

Developed by w11wo
Korean automatic speech recognition model based on XLS-R architecture, fine-tuned on the Zeroth Korean dataset with an added 5-gram language model
Downloads 23
Release Time : 3/2/2022

Model Overview

This model is a deep learning model for Korean automatic speech recognition (ASR), fine-tuned based on Facebook's Wav2Vec2-XLS-R-300M architecture, suitable for Korean speech-to-text tasks.

Model Features

Korean Optimization
Specially fine-tuned for Korean speech recognition, performing well on the Zeroth Korean dataset
5-gram Language Model Enhancement
Incorporated a 5-gram language model trained on the Open Subtitles Korean subset to improve recognition accuracy
Robustness Testing
Participated in HuggingFace's Robust Speech Challenge, testing performance under various conditions

Model Capabilities

Korean Speech Recognition
Speech-to-Text
Supports 5-gram Language Model Decoding

Use Cases

Speech Transcription
Korean Speech Transcription
Convert Korean speech content into text
Achieved 30.94% WER and 7.97% CER on the Zeroth Korean dataset
Voice Assistants
Korean Voice Command Recognition
Recognize and understand Korean voice commands
Achieved 66.47% WER on robust speech event test data
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase