S

Stt Fr Fastconformer Hybrid Large Pc

Developed by nvidia
This is a French automatic speech recognition model based on the FastConformer architecture, combining Transducer and CTC decoders for high accuracy and multi-domain adaptability.
Downloads 1,331
Release Time : 5/23/2023

Model Overview

The model can transcribe speech containing uppercase and lowercase French letters, spaces, periods, commas, and question marks. It is the 'large' version of the FastConformer Transducer-CTC model with approximately 115 million parameters.

Model Features

Hybrid Training
Combines both Transducer and CTC loss functions during training to improve model robustness.
Optimized Architecture
Utilizes the FastConformer architecture with 8x depthwise separable convolution downsampling for higher efficiency.
Multi-Dataset Training
Trained on 1800 hours of French speech data, including MCV12, MLS, and Voxpopuli datasets.
Punctuation Support
Supports transcription of text containing periods, commas, and question marks.

Model Capabilities

French Speech Recognition
Punctuation Recognition
Case Recognition
Long Audio Processing

Use Cases

Speech Transcription
Meeting Minutes
Convert French meeting recordings into text transcripts
WER 7.92 (MCV12 test set)
Video Captioning
Generate subtitles for French video content
WER 5.21 (MLS test set)
Speech Analysis
Speech Data Analysis
Analyze keywords and content in French speech data
WER 6.49 (VoxPopuli test set)
Featured Recommended AI Models
ยฉ 2025AIbase