A

Asr Crdnn Rnnlm Librispeech

Developed by speechbrain
This is an end-to-end automatic speech recognition system pre-trained on the LibriSpeech dataset, employing a CRDNN architecture combined with CTC/attention mechanism and RNN language model, delivering excellent performance on English speech recognition tasks.
Downloads 1,354
Release Time : 3/2/2022

Model Overview

This model is a complete automatic speech recognition system, including a tokenizer, neural language model, and acoustic model, capable of converting English speech into text.

Model Features

Multi-Module Integration
Integrates a tokenizer, RNN language model, and CRDNN acoustic model to provide a complete speech recognition solution.
Dual Decoding Mechanism
Utilizes both CTC and attention mechanisms for decoding to improve recognition accuracy.
Efficient Training
Trained on the LibriSpeech dataset, employing convolutional neural network blocks and bidirectional LSTM for acoustic feature extraction.

Model Capabilities

English Speech Recognition
Audio Transcription
Speech-to-Text

Use Cases

Speech Transcription
Audio File Transcription
Convert English speech files into text
Achieves a word error rate of 3.09% on the LibriSpeech test set.
Featured Recommended AI Models
ยฉ 2025AIbase