H

Hubert Large Korean

Developed by team-lucid
Hubert-large-korean is a Korean automatic speech recognition model based on the Hubert architecture. It extracts features directly from speech waveforms through self-supervised learning and performs excellently in Korean speech processing.
Downloads 131
Release Time : 6/4/2023

Model Overview

This model adopts the Hidden-Unit BERT architecture and is specifically designed for Korean speech recognition tasks. It can learn directly from the original speech signal without relying on traditional feature extraction methods.

Model Features

Self-supervised learning
Learn directly from the raw waveform of the speech signal without manually labeled data
Korean optimization
Specifically trained and optimized for the characteristics of Korean speech
Large-scale training
Trained with approximately 4000 hours of Korean speech data
High-performance architecture
Adopts a 24-layer Transformer encoder, a 1024-dimensional embedding space, and 16 attention heads

Model Capabilities

Korean speech recognition
Speech feature extraction
Speech waveform processing

Use Cases

Speech to text
Korean speech transcription
Convert Korean speech content into text
Speech analysis
Speech feature analysis
Extract high-level feature representations of the speech signal
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase