I

Indicconformer Stt As Hybrid Ctc Rnnt Large

Developed by ai4bharat
IndicConformer is a Conformer-based automatic speech recognition (ASR) model with a hybrid CTC-RNNT architecture, supporting Assamese speech transcription.
Downloads 101
Release Time : 9/5/2024

Model Overview

This model employs a Conformer-Large architecture as the encoder, equipped with a hybrid CTC-RNNT decoder, capable of transcribing Assamese speech into text.

Model Features

Hybrid Decoder Architecture
Combines the advantages of CTC and RNNT decoders to improve speech recognition accuracy and robustness.
Large Model Capacity
Includes 17 Conformer modules with a model dimension of 512 and 120 million parameters, capable of handling complex speech patterns.
Assamese Language Support
Specifically optimized for Assamese speech recognition, suitable for transcription tasks in this language.

Model Capabilities

Assamese Speech Recognition
Hybrid CTC-RNNT Decoding
16kHz Mono Audio Processing

Use Cases

Speech Transcription
Assamese Speech-to-Text
Transcribes Assamese audio files into text, suitable for voice recording, subtitle generation, and other scenarios.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase