S

Stt Es Conformer Ctc Large

Developed by nvidia
This is a large Conformer-CTC model for Spanish automatic speech recognition (ASR), trained and released by NVIDIA.
Downloads 59
Release Time : 7/8/2022

Model Overview

This model is used to transcribe speech containing lowercase Spanish letters with spaces, based on the Conformer architecture, using CTC loss/decoding method.

Model Features

High-performance Recognition
Excellent performance on multiple test sets, such as a WER of 5.5% on the Common Voice 7.0 test set
Large Training Dataset
Trained with 1,340 hours of Spanish speech data
Riva Deployment Compatible
Can be used with NVIDIA Riva for production-grade server deployment
Non-autoregressive Architecture
Adopts a non-autoregressive Conformer-CTC architecture with approximately 120 million parameters

Model Capabilities

Spanish Speech Recognition
Audio Transcription
Supports 16kHz Mono Audio Input

Use Cases

Speech-to-Text
Speech Transcription Service
Convert Spanish speech content into text
Highly accurate transcription results
Voice Assistant
Spanish Voice Assistant
Used for developing Spanish voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase