S

Stt En Fastconformer Transducer Large

Developed by nvidia
This is a large automatic speech recognition (ASR) model based on the FastConformer architecture, specifically designed for transcribing English speech into text.
Downloads 1,398
Release Time : 6/8/2023

Model Overview

The model employs an optimized FastConformer architecture and Transducer decoder to efficiently and accurately convert English speech into text.

Model Features

Optimized FastConformer Architecture
Utilizes 8x depthwise separable convolution downsampling, making it more efficient than standard Conformer models
Multi-dataset Training
Trained on a comprehensive dataset containing thousands of hours of English speech, covering various speech scenarios
High Performance
Outperforms on multiple standard test sets, such as achieving a WER as low as 1.8% on the LibriSpeech test set
Easy to Use
Provides a simple Python API for speech transcription, supporting batch processing

Model Capabilities

English speech recognition
Audio transcription
Batch speech processing

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe meeting recordings into text records
Media Caption Generation
Automatically generate captions for video and podcast content
Speech Analysis
Customer Service Call Analysis
Transcribe and analyze customer service call content
Featured Recommended AI Models
ยฉ 2025AIbase