S

Stt De Fastconformer Hybrid Large Pc

Developed by nvidia
This is a German automatic speech recognition model based on the FastConformer architecture, employing a hybrid training approach with Transformer and CTC, with a parameter size of approximately 115M.
Downloads 1,017
Release Time : 5/4/2023

Model Overview

This model is used for German speech transcription, supporting recognition of uppercase and lowercase letters, spaces, and basic punctuation marks. It is a pre-trained model from the NVIDIA NeMo toolkit.

Model Features

Hybrid Training Architecture
Utilizes both Transformer and CTC decoders for training, combining the advantages of both loss functions
Optimized FastConformer
Employs 8x depthwise separable convolution downsampling, making it more efficient than standard Conformer models
Multi-dataset Training
Trained on a composite German dataset of 2500 hours, including MCV12, MLS, and Voxpopuli
Punctuation Support
Capable of recognizing basic punctuation marks such as periods, commas, and question marks

Model Capabilities

German Speech Recognition
Punctuation Recognition
Case Letter Recognition

Use Cases

Speech Transcription
Speech-to-Text
Convert German speech content into text
Achieves a WER of 5.1% on the MCV12 test set
Meeting Minutes
Automatically generate text transcripts of meeting speeches
Speech Analysis
Speech Content Analysis
Provides a textual foundation for subsequent speech content analysis
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase