A

Asr Wav2vec2 Commonvoice 14 Zh CN

Developed by speechbrain
This is an end-to-end automatic speech recognition system trained on the CommonVoice Chinese dataset, using wav2vec2.0 and CTC architecture, supporting Chinese speech recognition.
Downloads 36
Release Time : 8/9/2023

Model Overview

This model is an automatic speech recognition system specifically designed for Chinese speech, capable of converting Chinese speech to text. It combines a pre-trained wav2vec2.0 model with a CTC decoder, fine-tuned on the CommonVoice Chinese dataset.

Model Features

End-to-end speech recognition
Provides complete speech-to-text conversion without requiring additional language models
Pre-trained on wav2vec2.0
Uses facebook/wav2vec2-large-xlsr-53 as the base model, with powerful acoustic feature extraction capabilities
Optimized for Chinese
Specifically optimized for Chinese speech characteristics, fine-tuned on the CommonVoice Chinese dataset
Lightweight inference
Supports both CPU and GPU inference, suitable for various deployment scenarios

Model Capabilities

Chinese speech recognition
Audio transcription
Speech-to-text

Use Cases

Speech transcription
Automatic meeting transcription
Automatically convert Chinese meeting recordings into text transcripts
Voice note conversion
Convert users' Chinese voice notes into editable text
Assistive technology
Voice input system
Add Chinese voice input functionality to applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase