W

Wav2vec2 Xls R 300m Zh CN

Developed by anantoj
This model is an automatic speech recognition (ASR) model fine-tuned on the general speech dataset ZH-CN based on facebook/wav2vec2-xls-r-300m, supporting Mandarin Chinese recognition.
Downloads 37
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition model optimized for Mandarin Chinese, fine-tuned on a general speech dataset, capable of converting speech to text.

Model Features

Chinese optimization
Specifically fine-tuned for Mandarin Chinese, performing well on Chinese speech recognition tasks
Based on large model
Built on the 300M-parameter wav2vec2-xls-r large model with strong speech feature extraction capabilities
General speech dataset
Trained using the Common Voice dataset, demonstrating good generalization ability

Model Capabilities

Chinese speech recognition
Speech-to-text
Automatic speech transcription

Use Cases

Speech transcription
Meeting minutes
Automatically convert meeting recordings into text records
CER (Character Error Rate) approximately 20.59%
Voice input
Provide voice input functionality for applications
Accessibility technology
Real-time captions
Provide real-time speech-to-text services for hearing-impaired individuals
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase