A

Asr Transformer Aishell

Developed by speechbrain
A pre-trained end-to-end automatic speech recognition system for Mandarin based on the SpeechBrain framework, featuring a Transformer encoder + joint decoder architecture
Downloads 76
Release Time : 3/2/2022

Model Overview

This is a Transformer model for Mandarin automatic speech recognition, trained on the AISHELL dataset, capable of converting Chinese speech into text.

Model Features

Joint Decoding Mechanism
Combines CTC and Transformer decoders, integrating CTC probability scores during decoding to improve recognition accuracy
Subword Unit Tokenization
Uses a unigram algorithm-based tokenizer to convert words into subword units, enhancing the model's generalization capability for vocabulary
Automatic Audio Processing
Built-in audio normalization processing, including automatic resampling and mono channel selection, simplifying the usage process

Model Capabilities

Mandarin speech recognition
Audio transcription
Batch speech processing

Use Cases

Speech Transcription
Chinese Meeting Minutes
Automatically convert Chinese meeting recordings into text transcripts
Test set CER of 6.04%
Voice Input System
Provide voice input functionality for Chinese applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase