D

Distill Whisper Th Medium

Developed by biodatlab
A distilled automatic speech recognition model based on the Whisper architecture, optimized for Thai language with balanced performance and efficiency
Downloads 303
Release Time : 1/16/2024

Model Overview

This is a distilled Whisper model specifically designed for Thai speech recognition. It is distilled from a large teacher model, improving efficiency while maintaining high recognition accuracy.

Model Features

Efficient Distillation Architecture
Adopts a 4-layer decoder structure (original teacher model has 24 layers), significantly improving efficiency while maintaining performance
Thai Language Optimization
Specially optimized and trained for the characteristics of Thai speech
Multi-source Training Data
Trained using multi-source data including Common Voice, Gowajee, and Thai elderly speech corpus
Dialect Support
Includes dialect data such as Central Thai, enhancing recognition capability for dialects

Model Capabilities

Thai speech recognition
Dialect recognition
Efficient speech-to-text

Use Cases

Speech Transcription
Thai Meeting Minutes
Real-time transcription of Thai meeting content into text
Voice Notes
Convert Thai voice notes into searchable text
Accessibility Applications
Hearing Assistance
Provide real-time captions for the hearing impaired
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase