W

Whisper Th Large V3 Combined

Developed by biodatlab
This is a Thai automatic speech recognition model fine-tuned based on OpenAI's Whisper Large V3 model, achieving a 6.59% word error rate on the Common Voice 13 Thai test set.
Downloads 1,354
Release Time : 2/20/2024

Model Overview

This model is an automatic speech recognition (ASR) model optimized for Thai, fine-tuned on enhanced versions of the Common Voice 13 and FLEURS datasets, specifically designed for Thai speech transcription tasks.

Model Features

Low Word Error Rate
Only 6.59% word error rate (WER) on the Common Voice 13 Thai test set
Thai Optimization
Specially fine-tuned for Thai speech characteristics
Mixed Dataset Training
Enhanced training using multiple datasets including Common Voice 13 and FLEURS

Model Capabilities

Thai Speech Recognition
Audio Transcription
Long Audio Processing (supports 30-second chunks)

Use Cases

Speech Transcription
Thai Meeting Minutes
Automatically transcribe Thai meeting recordings into text
Highly accurate transcription text
Thai Media Subtitle Generation
Automatically generate subtitles for Thai video content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase