Whisper-large-v2-cantonese Open-source Cantonese Speech Recognition Model - Precise Recognition, Low Character Error Rate!

Whisper Large V2 Cantonese

Developed by simonl0909

An automatic speech recognition model fine-tuned on Cantonese dataset based on OpenAI Whisper Large V2, achieving a character error rate of 6.7274% on the test set

Speech Recognition

Transformers

OtherOpen Source License:Apache-2.0 #Cantonese speech recognition #Low character error rate #Common Voice fine-tuning

Downloads 131

Release Time : 12/11/2022

Model Overview

Speech recognition model specifically optimized for Cantonese, suitable for Cantonese speech-to-text tasks

Model Features

Cantonese optimization

Fine-tuned on Common Voice Cantonese dataset, specifically optimized for Cantonese speech recognition

Low error rate

Achieves a character error rate (CER) of 6.7274% on the test set, demonstrating excellent performance

Based on Whisper architecture

Built upon the powerful Whisper Large V2 base model, inheriting its outstanding speech recognition capabilities

Model Capabilities

Cantonese speech recognition

Speech-to-text

Automatic speech transcription

Use Cases

Speech transcription

Cantonese meeting minutes

Automatically transcribe Cantonese meeting content into written records

Character error rate 6.7274%

Cantonese media subtitle generation

Automatically generate subtitles for Cantonese video content

Voice assistant

Cantonese voice interaction

Supports Cantonese voice command recognition

Property	Details
Language	yue
License	apache - 2.0
Tags	whisper - event, hf - asr - leaderboard, generated_from_trainer
Datasets	mozilla - foundation/common_voice_11_0
Metrics	cer
Base Model	openai/whisper-large-v2

Training Loss	Epoch	Step	Validation Loss	Cer
0.0032	13.01	1000	0.2318	6.8569
0.002	26.01	2000	0.2404	7.1524
0.0001	39.02	3000	0.2807	6.7274
0.0001	53.01	4000	0.2912	6.7517
0.0	66.01	5000	0.2957	6.7638

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Whisper Large V2 Cantonese

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Whisper Large V2 Cantonese

🚀 Quick Start

📚 Documentation

Model Index

Training Procedure

Training Hyperparameters

Training Results

Framework Versions

📄 License