Stt Kr Conformer Ctc Medium
S
Stt Kr Conformer Ctc Medium
Developed by SungBeom
Korean automatic speech recognition model based on Conformer architecture, optimized for stream processing with excellent performance in specific domains like customer service voice
Downloads 176
Release Time : 6/4/2023
Model Overview
This model is a Korean automatic speech recognition model based on the Conformer-CTC architecture, fine-tuned for the AI Hub dataset. Compared to attention-based models, it shows less performance degradation during stream processing and operates faster, making it particularly suitable for real-time speech recognition applications.
Model Features
Stream Processing Optimization
Compared to attention-based models like Whisper, it shows less performance degradation during stream processing (about 20%) and faster processing speed
Efficient Inference
Real-time factor (RTF) is 0.05 on V100 GPU and 0.35 on CPU (7 cores), suitable for real-time applications
Strong Domain Adaptability
In specific domains like customer service voice, when combined with KenLM, the word error rate can be significantly reduced from 13.45 to 5.27
Model Capabilities
Korean speech recognition
Real-time stream speech processing
Optimized for specific domain speech recognition
Use Cases
Customer Service Domain
Customer Service Voice Transcription
Used for real-time voice transcription of customer service calls
Word error rate reduced from 13.45 to 5.27 when combined with KenLM
In-car Systems
In-car Voice Command Recognition
Used to recognize in-car conversations and voice commands
Featured Recommended AI Models
Š 2025AIbase