S

Stt Uz Fastconformer Hybrid Large Pc

Developed by nvidia
This is a large-scale Uzbek speech recognition model based on the FastConformer architecture, supporting both Transducer and CTC decoding, and demonstrating excellent performance across multiple test sets.
Downloads 96
Release Time : 10/31/2024

Model Overview

This model is designed for Uzbek speech recognition, capable of transcribing text including uppercase and lowercase letters, spaces, and common punctuation marks, suitable for general speech recognition scenarios.

Model Features

Dual Decoding Mechanism
Supports both Transducer and CTC decoding methods, providing more flexible inference options
Efficient Architecture
Utilizes the optimized FastConformer architecture, offering higher computational efficiency compared to standard Conformer
Multi-dataset Training
Trained on 1000 hours of Uzbek speech data, covering various speech scenarios

Model Capabilities

Uzbek speech recognition
Audio to text conversion
Batch speech processing

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe Uzbek meeting recordings into written records
WER around 16-17% in general scenarios
Voice Assistant
Provide speech recognition capabilities for Uzbek voice assistants
Education
Language Learning
Assist Uzbek language learners in checking pronunciation accuracy
Featured Recommended AI Models
ยฉ 2025AIbase