P

Parakeet Rnnt 1.1b

Developed by nvidia
Parakeet RNNT 1.1B is an automatic speech recognition model jointly developed by NVIDIA NeMo and Suno.ai, based on the FastConformer Transducer architecture with approximately 1.1 billion parameters, supporting English speech transcription.
Downloads 13.18k
Release Time : 12/27/2023

Model Overview

This model is used to transcribe English speech into lowercase English text, demonstrating excellent performance on multiple public datasets.

Model Features

High-performance speech recognition
Achieves leading word error rate (WER) performance on multiple public test sets
Large-scale training data
Trained on a total of 64K hours of English speech data, including multiple public datasets
Optimized model architecture
Utilizes FastConformer architecture with 8x depthwise separable convolution downsampling
Multi-task training
Trained in a multi-task setting using transducer decoder (RNNT) loss

Model Capabilities

English speech recognition
Audio transcription
Automatic speech-to-text

Use Cases

Speech transcription
Meeting minutes
Automatically transcribe meeting recordings
Achieves 17.10% WER on AMI test set
Speech-to-text services
Generate transcripts for audio content
Achieves as low as 1.46% WER on LibriSpeech test set
Voice assistants
Provide speech recognition capabilities for voice assistants
Achieves 5.79% WER on Common Voice test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase