I

Indri 0.1 124m Tts

Developed by 11mlabs
Indri is an ultra-compact lightweight TTS model based on Transformer architecture, supporting English and Hindi text-to-speech tasks.
Downloads 182
Release Time : 11/12/2024

Model Overview

This model can generate high-quality audio while maintaining speaker style cloning consistency, supporting voice cloning through short prompts.

Model Features

Ultra-Compact Lightweight
Based on GPT-2 small architecture with only 124M parameters, scalable to any autoregressive Transformer-based architecture
Ultra-Fast Inference
Achieves speeds up to 400 tokens/s on RTX6000Ada GPU, with first token latency below 20ms
Voice Cloning Support
Enables speaker style cloning with short prompts (<5 seconds)
Multilingual Mixed Support
Supports code-mixed text input for English and Hindi

Model Capabilities

Text-to-Speech
Voice Cloning
Multilingual Mixed Processing

Use Cases

Speech Synthesis
Multilingual Audiobooks
Generates natural speech for English and Hindi content
High-quality audio output with speaker consistency
Voice Assistants
Provides speech synthesis capabilities for multilingual voice assistants
Supports fast-response voice generation
Education
Language Learning Tools
Provides pronunciation examples for language learners
Supports bilingual mixed pronunciation demonstrations
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase