S

Stt Zh Conformer Transducer Large

Developed by nvidia
This is a large Conformer-Transducer model for transcribing Mandarin speech, with approximately 120 million parameters, trained on the AISHELL-2 dataset.
Downloads 72
Release Time : 6/29/2022

Model Overview

This model is an automatic speech recognition model based on the Conformer-Transducer architecture, specifically designed for Mandarin speech transcription tasks.

Model Features

High-performance Transcription
Achieves a character error rate (CER) of 5.3-5.7% on the AISHELL-2 test set
Large-scale Training
Utilizes a large model architecture with approximately 120 million parameters for more accurate transcription
Mandarin Optimization
Specially trained and optimized for Mandarin speech

Model Capabilities

Mandarin speech recognition
Audio transcription
Speech-to-text

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe Mandarin meeting recordings into text records
Approximately 94.3-94.7% accuracy
Voice Assistant
Provide speech recognition capabilities for Mandarin voice assistants
Featured Recommended AI Models
ยฉ 2025AIbase