W

Wav2vec2 Large Xlsr 53 Chinese Zn Cn Aishell1

Developed by qinyue
A Chinese speech recognition model fine-tuned on the AISHELL-1 dataset based on facebook/wav2vec2-large-xlsr-53, supporting Chinese speech recognition tasks.
Downloads 22
Release Time : 6/16/2022

Model Overview

This model is an automatic speech recognition (ASR) model specifically optimized for Chinese speech, capable of converting Chinese speech into text.

Model Features

Chinese Speech Recognition
A recognition model specifically optimized for Chinese speech, performing excellently on the AISHELL-1 dataset.
No Language Model Required
Can be used directly without additional language model support.
High Accuracy
Achieves a word error rate (WER) of 7.04% on the AISHELL-1 test set, which can be reduced to 3.96% with a language model.

Model Capabilities

Chinese Speech Recognition
16kHz Sampling Rate Audio Processing

Use Cases

Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Accuracy up to 92.96% (WER 7.04%)
Voice Assistant
Used for human-computer interaction in Chinese voice assistants
Speech Analysis
Speech Content Analysis
Analyze keywords and topics in speech content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase