S

Stt Kr Conformer Ctc Medium

Developed by SungBeom
Korean automatic speech recognition model based on Conformer architecture, optimized for stream processing with excellent performance in specific domains like customer service voice
Downloads 176
Release Time : 6/4/2023

Model Overview

This model is a Korean automatic speech recognition model based on the Conformer-CTC architecture, fine-tuned for the AI Hub dataset. Compared to attention-based models, it shows less performance degradation during stream processing and operates faster, making it particularly suitable for real-time speech recognition applications.

Model Features

Stream Processing Optimization
Compared to attention-based models like Whisper, it shows less performance degradation during stream processing (about 20%) and faster processing speed
Efficient Inference
Real-time factor (RTF) is 0.05 on V100 GPU and 0.35 on CPU (7 cores), suitable for real-time applications
Strong Domain Adaptability
In specific domains like customer service voice, when combined with KenLM, the word error rate can be significantly reduced from 13.45 to 5.27

Model Capabilities

Korean speech recognition
Real-time stream speech processing
Optimized for specific domain speech recognition

Use Cases

Customer Service Domain
Customer Service Voice Transcription
Used for real-time voice transcription of customer service calls
Word error rate reduced from 13.45 to 5.27 when combined with KenLM
In-car Systems
In-car Voice Command Recognition
Used to recognize in-car conversations and voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase