Whisper Small Ko Low Qual Voice
A Korean automatic speech recognition model fine-tuned based on the Whisper-small architecture, which performs excellently in Korean speech recognition tasks.
Downloads 211
Release Time : 7/2/2025
Model Overview
This model is a Korean automatic speech recognition model fine-tuned based on the Whisper-small architecture, suitable for various Korean speech processing scenarios, such as conversations, broadcasts, news, etc.
Model Features
Accurate recognition
Performs excellently in Korean speech recognition tasks and can accurately transcribe Korean speech content.
Suitable for multiple scenarios
Can be used for offline or batch transcription of Korean speech data and can also be integrated into Korean speech assistant systems.
Highly scalable
Supports further fine-tuning on specific domain datasets, such as law, medicine, education, etc.
Model Capabilities
Korean speech recognition
Speech transcription
Speech assistant integration
Use Cases
Speech transcription
Offline speech transcription
Used for batch transcription of Korean speech data.
Speech assistant integration
Integrated into Korean speech assistant systems.
Domain-specific applications
Legal domain
Further fine-tuned on legal domain datasets for legal speech transcription.
Medical domain
Further fine-tuned on medical domain datasets for medical speech transcription.
Featured Recommended AI Models
Qwen2.5 VL 7B Abliterated Caption It I1 GGUF
Apache-2.0
Quantized version of Qwen2.5-VL-7B-Abliterated-Caption-it, supporting multilingual image description tasks.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
167
1
Nunchaku Flux.1 Dev Colossus
Other
The Nunchaku quantized version of the Colossus Project Flux, designed to generate high-quality images based on text prompts. This model minimizes performance loss while optimizing inference efficiency.
Image Generation English
N
nunchaku-tech
235
3
Qwen2.5 VL 7B Abliterated Caption It GGUF
Apache-2.0
This is a static quantized version based on the Qwen2.5-VL-7B model, focusing on image captioning generation tasks and supporting multiple languages.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
133
1
Olmocr 7B 0725 FP8
Apache-2.0
olmOCR-7B-0725-FP8 is a document OCR model based on the Qwen2.5-VL-7B-Instruct model. It is fine-tuned using the olmOCR-mix-0225 dataset and then quantized to the FP8 version.
Image-to-Text
Transformers English

O
allenai
881
3
Lucy 128k GGUF
Apache-2.0
Lucy-128k is a model developed based on Qwen3-1.7B, focusing on proxy-based web search and lightweight browsing, and can run efficiently on mobile devices.
Large Language Model
Transformers English

L
Mungert
263
2