S

Stt Fa Fastconformer Hybrid Large

Developed by nvidia
This is a hybrid model for Persian Automatic Speech Recognition (ASR), combining transducer and CTC decoder losses, optimized based on the FastConformer architecture.
Downloads 2,398
Release Time : 11/21/2023

Model Overview

This model is used to transcribe Persian speech into text, being the 'large' version of the FastConformer transducer-CTC model with 115M parameters.

Model Features

Hybrid Training
Trained with both transducer and CTC decoder losses to enhance model robustness
Optimized Architecture
Based on the FastConformer architecture with 8x depthwise separable convolution downsampling
High Accuracy
Achieves excellent performance with 13.16% WER and 3.85% CER on Persian test sets

Model Capabilities

Persian Speech Recognition
Audio Transcription
Real-time Speech Processing

Use Cases

Speech-to-Text
Persian Speech Transcription
Convert Persian speech into text
Achieves 13.16% WER on the CommonVoice test set
Voice Assistants
Persian Voice Command Recognition
Used for developing Persian voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase