A

Asr Conformer Largescaleasr

Developed by speechbrain
This is an end-to-end automatic speech recognition system trained using the SpeechBrain framework, employing the Conformer architecture on 25,000 hours of English speech data.
Downloads 92
Release Time : 2/6/2025

Model Overview

This model is a high-performance automatic speech recognition system that combines a Conformer encoder with a CTC+Transformer joint decoder, supporting English speech transcription.

Model Features

Large-scale training data
Trained on the 25,000-hour LargeScaleASR dataset, covering various speech scenarios
Efficient architecture
Utilizes the Conformer architecture, combining the strengths of CNN and Transformer, ideal for speech recognition tasks
Flexible decoding
Supports multiple decoding methods, including large beam width full decoding, greedy decoding, and attention-only decoding

Model Capabilities

English speech recognition
Audio transcription
Speech-to-text

Use Cases

Speech transcription
Meeting minutes
Automatically transcribe meeting recordings into text records
Validation set WER 6.8, test set WER 7.5
Voice notes
Convert voice notes into searchable text
Assistive technology
Real-time caption generation
Generate real-time captions for video or live content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase