W

Wav2vec2 Large Chinese Zh Cn

Developed by wbbbbb
Chinese speech recognition model fine-tuned based on XLSR-53 large model, supporting 16kHz sampled audio input
Downloads 585
Release Time : 7/18/2022

Model Overview

This model is a fine-tuned XLSR-53 large model for Chinese speech recognition tasks, trained on Chinese speech datasets such as Common Voice, and can be directly used for speech-to-text tasks

Model Features

Chinese Speech Recognition Optimization
Specially fine-tuned for Chinese speech characteristics, outperforming general models in Chinese speech recognition tasks
Multi-dataset Training
Trained using multiple Chinese speech datasets including Common Voice 6.1, CSS10, and ST-CMDS
No Language Model Required
Can be used directly without additional language model support

Model Capabilities

Chinese Speech Recognition
Speech-to-Text
16kHz Audio Processing

Use Cases

Speech Transcription
Automatic Meeting Minutes Transcription
Automatically convert Chinese meeting recordings into text records
Voice Note Conversion
Convert personal voice memos into searchable text
Accessibility Applications
Real-time Caption Generation
Provide real-time speech-to-text services for hearing-impaired users
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase