Whisper Large Zh Cv11
A speech recognition model fine-tuned on Chinese (Mandarin) using the Common Voice 11 dataset, based on openai/whisper-large-v2
Downloads 145
Release Time : 12/18/2022
Model Overview
This model is an automatic speech recognition (ASR) model optimized for Chinese (Mandarin), fine-tuned on the Common Voice 11 dataset, significantly improving Chinese speech recognition accuracy.
Model Features
Chinese Optimization
Specially fine-tuned for Mandarin Chinese, significantly improving Chinese speech recognition accuracy
Multi-scenario Evaluation
Comprehensively evaluated on both Common Voice and Fleurs datasets, covering original text and standardized text scenarios
Punctuation Support
Capable of recognizing and transcribing punctuation marks in speech
Model Capabilities
Mandarin Speech Recognition
Punctuation Recognition
Case Conversion
Use Cases
Speech Transcription
Meeting Minutes
Automatically transcribe Chinese meeting recordings into text records
On the Common Voice test set, CER is 9.55%, WER is 55.02%
Voice Notes
Convert personal voice memos into text
Voice Assistant
Chinese Voice Command Recognition
Used for Chinese voice command recognition in smart homes or mobile devices
Featured Recommended AI Models
Š 2025AIbase