W

Whisper Small Cantonese

Developed by alvanlii
A Cantonese speech recognition model fine-tuned based on OpenAI Whisper-small, achieving a CER of 7.93 on the Common Voice 16.0 test set
Downloads 2,413
Release Time : 12/8/2022

Model Overview

An automatic speech recognition model optimized for Cantonese, supporting efficient and accurate Cantonese speech-to-text conversion

Model Features

Optimized Cantonese Recognition
Specially fine-tuned for Cantonese characteristics, achieving a character error rate (CER) as low as 7.93
Efficient Inference
Supports Flash Attention acceleration, processing a single sample in just 0.055 seconds
Multi-format Support
Provides GGML and CT2 formats, compatible with tools like Whisper.cpp and WhisperX
Speculative Decoding Support
Can serve as an auxiliary model to accelerate the inference process of larger models

Model Capabilities

Cantonese Speech Recognition
Chinese Speech Recognition
Fast Speech-to-Text Conversion
Long Audio Processing (supports chunking)

Use Cases

Speech Transcription
Cantonese Video Subtitle Generation
Automatically generates accurate subtitles for Cantonese video content
Recognition accuracy with CER 7.93
Voice Assistant
Builds Cantonese-supported voice interaction applications
Fast response (0.055 seconds/sample)
Speech Analysis
Cantonese Speech Data Analysis
Transcribes and analyzes Cantonese speech content
Supports multiple Cantonese dataset formats
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase