W

Wav2vec2 Large Xlsr 53 Chinese Zh Cn Gpt

Developed by ydshieh
A Chinese (zh-CN) speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Downloads 127
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) model for Mandarin Chinese, based on the Wav2Vec2 architecture, fine-tuned on the Chinese dataset of Common Voice, supporting Simplified Chinese speech-to-text tasks.

Model Features

Multi-dataset Fine-tuning
Trained using both zh-CN and zh-TW datasets from Common Voice, with Traditional Chinese converted to Simplified Chinese
No Language Model Required
Can be used directly without additional language model support
Standard Sampling Rate Support
Supports standard 16kHz speech input sampling rate

Model Capabilities

Chinese Speech Recognition
Speech-to-Text
Mandarin Recognition

Use Cases

Speech Transcription
Speech Transcription
Convert Chinese speech content into text format
CER 20.90%
Voice Assistants
Voice Command Recognition
Recognize user's Chinese voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase