S

Stt Es Conformer Transducer Large

Developed by nvidia
This is a large Conformer-Transducer model for Spanish automatic speech recognition, with approximately 120 million parameters, trained on 1340 hours of Spanish speech data.
Downloads 708
Release Time : 7/8/2022

Model Overview

This model is used to transcribe speech containing lowercase Spanish letters with spaces, based on the Conformer-Transducer architecture, offering high accuracy and streaming capabilities.

Model Features

High Accuracy Spanish Recognition
Performs exceptionally well on multiple test sets, such as a WER of only 5.2% on the Common Voice 7.0 test set.
Large-scale Training Data
Trained on a composite dataset containing 1340 hours of Spanish speech.
Streaming Capability
Based on the Transducer architecture, supporting streaming speech recognition.

Model Capabilities

Spanish Speech Recognition
Audio Transcription
Streaming Speech Processing

Use Cases

Speech-to-Text
Speech Transcription Service
Convert Spanish speech content into text
Highly accurate transcription results
Voice Assistants
Spanish Voice Interaction
Used for developing Spanish voice assistants
Featured Recommended AI Models
ยฉ 2025AIbase