W

Whisper Th Medium Combined

Developed by biodatlab
Fine-tuned on an enhanced Thai dataset based on openai/whisper-medium for Thai automatic speech recognition
Downloads 4,167
Release Time : 12/14/2022

Model Overview

This model is a Thai automatic speech recognition model fine-tuned on the enhanced Thai dataset of mozilla-foundation/common_voice_13_0, the google/fleurs dataset, and a selected dataset based on openai/whisper-medium.

Model Features

High-precision Thai recognition
Achieved a word error rate (WER) of 7.42 on the common-voice-13 test set
Multi-dataset fine-tuning
Fine-tuned based on mozilla-foundation/common_voice_13_0, google/fleurs, and a selected dataset
Support for long audio processing
Supports long audio segmentation processing with chunk_length_s=30

Model Capabilities

Thai speech recognition
Long audio transcription

Use Cases

Speech transcription
Thai speech to text
Convert Thai speech files to text
Word error rate 7.42
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase