S

Stt De Conformer Ctc Large

Developed by nvidia
This is a large-scale Conformer-CTC model for German automatic speech recognition, trained and optimized by NVIDIA on thousands of hours of German speech data.
Downloads 132
Release Time : 6/28/2022

Model Overview

The model transcribes German speech into lowercase text (including spaces) and uses a non-autoregressive variant of the Conformer architecture with approximately 120 million parameters.

Model Features

Large-scale Training Data
Trained on thousands of hours of German speech data, including datasets such as VoxPopuli, Multilingual LibriSpeech, and Mozilla Common Voice.
High Performance
Delivers outstanding performance on multiple test sets, achieving a WER of 6.68% on the Common Voice 7 test set.
Riva Compatible
Compatible with NVIDIA Riva for production-grade server deployment.
Non-autoregressive Architecture
Uses a non-autoregressive variant of Conformer with CTC loss/decoding, optimized for efficient speech recognition.

Model Capabilities

German Speech Recognition
Audio Transcription
Supports 16kHz Mono Audio Input

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe German meeting recordings into text records
Highly accurate transcribed text
Voice Assistants
Provide speech recognition capabilities for German voice assistants
Real-time and accurate speech-to-text
Media Processing
Subtitle Generation
Automatically generate subtitles for German video content
Efficient and accurate synchronized subtitles
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase