A

Asr Conformer Transformerlm Librispeech

Developed by speechbrain
An automatic speech recognition model based on the SpeechBrain framework, using a Conformer encoder and Transformer decoder, trained on the LibriSpeech dataset, supporting English speech recognition.
Downloads 984
Release Time : 6/21/2023

Model Overview

This model is an end-to-end automatic speech recognition system, including a tokenizer, neural language model, and acoustic model, capable of converting English speech into text.

Model Features

Joint Decoding
Combines CTC and Transformer decoder to improve recognition accuracy
High Performance
Achieves a word error rate of 2.0% (clean) and 4.5% (other) on the LibriSpeech test set
Complete Toolchain
Provides full tool support from training to inference

Model Capabilities

English speech recognition
Audio file transcription
Batch speech processing

Use Cases

Speech Transcription
Audio File Transcription
Convert English speech files into text
Highly accurate transcription results
Speech Processing Systems
Voice Assistant
Serves as the recognition backend for voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase