I

Indicconformer Stt Gu Hybrid Ctc Rnnt Large

Developed by ai4bharat
IndicConformer is a Conformer-based automatic speech recognition (ASR) model with a hybrid CTC-RNNT architecture, specifically designed for Gujarati speech transcription.
Downloads 340
Release Time : 9/5/2024

Model Overview

This model adopts the Conformer-Large architecture, capable of transcribing Gujarati speech from 16kHz mono audio into text.

Model Features

Hybrid Decoding Architecture
Supports both CTC and RNNT decoding methods, providing more flexible inference options
Large Model Capacity
Encoder structure with 120 million parameters, equipped with powerful speech feature extraction capabilities
Specialized Optimization
Specially trained and optimized for Gujarati language

Model Capabilities

Gujarati Speech Recognition
16kHz Audio Processing
Mono Audio Transcription

Use Cases

Speech-to-Text
Gujarati Meeting Minutes
Automatically transcribe Gujarati meeting recordings into text records
Generate accurate meeting transcripts
Voice Assistant
Provide voice input support for Gujarati-speaking users
Enable Gujarati voice interaction
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase