Indicconformer Stt As Hybrid Ctc Rnnt Large
I
Indicconformer Stt As Hybrid Ctc Rnnt Large
Developed by ai4bharat
IndicConformer is a Conformer-based automatic speech recognition (ASR) model with a hybrid CTC-RNNT architecture, supporting Assamese speech transcription.
Downloads 101
Release Time : 9/5/2024
Model Overview
This model employs a Conformer-Large architecture as the encoder, equipped with a hybrid CTC-RNNT decoder, capable of transcribing Assamese speech into text.
Model Features
Hybrid Decoder Architecture
Combines the advantages of CTC and RNNT decoders to improve speech recognition accuracy and robustness.
Large Model Capacity
Includes 17 Conformer modules with a model dimension of 512 and 120 million parameters, capable of handling complex speech patterns.
Assamese Language Support
Specifically optimized for Assamese speech recognition, suitable for transcription tasks in this language.
Model Capabilities
Assamese Speech Recognition
Hybrid CTC-RNNT Decoding
16kHz Mono Audio Processing
Use Cases
Speech Transcription
Assamese Speech-to-Text
Transcribes Assamese audio files into text, suitable for voice recording, subtitle generation, and other scenarios.
Featured Recommended AI Models
Š 2025AIbase