W

Whisper Large Zh Cv11

Developed by jonatasgrosman
A speech recognition model fine-tuned on Chinese (Mandarin) using the Common Voice 11 dataset, based on openai/whisper-large-v2
Downloads 145
Release Time : 12/18/2022

Model Overview

This model is an automatic speech recognition (ASR) model optimized for Chinese (Mandarin), fine-tuned on the Common Voice 11 dataset, significantly improving Chinese speech recognition accuracy.

Model Features

Chinese Optimization
Specially fine-tuned for Mandarin Chinese, significantly improving Chinese speech recognition accuracy
Multi-scenario Evaluation
Comprehensively evaluated on both Common Voice and Fleurs datasets, covering original text and standardized text scenarios
Punctuation Support
Capable of recognizing and transcribing punctuation marks in speech

Model Capabilities

Mandarin Speech Recognition
Punctuation Recognition
Case Conversion

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe Chinese meeting recordings into text records
On the Common Voice test set, CER is 9.55%, WER is 55.02%
Voice Notes
Convert personal voice memos into text
Voice Assistant
Chinese Voice Command Recognition
Used for Chinese voice command recognition in smart homes or mobile devices
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase