I

Indri 0.1 350m Tts

Developed by 11mlabs
Indri is a novel, ultra-small, lightweight TTS model based on the Transformer architecture, supporting text-to-speech tasks in English and Hindi.
Downloads 1,088
Release Time : 11/20/2024

Model Overview

This model models audio as tokens, capable of generating high-quality audio while maintaining speaker style consistency. Supports voice cloning and code-mixed text input.

Model Features

Small and Lightweight
Based on the GPT-2 medium architecture, compact yet powerful
Ultra-fast Inference
Achieves up to 300 toks/s generation speed on RTX6000Ada GPU, with first token latency below 20ms
Voice Cloning
Supports speaker style cloning based on short prompts (<5 seconds)
Multilingual Support
Supports code-mixed input for English and Hindi
Batch Processing
Supports batch processing of approximately 300 sequences on RTX6000Ada

Model Capabilities

Text-to-speech
Voice Cloning
Multilingual Speech Synthesis
Batch Voice Generation

Use Cases

Content Creation
Audiobook Generation
Automatically generates high-quality audio versions for e-books
Offers multiple speaker style options
Educational Content
Generates multilingual speech content for educational materials
Supports mixed English and Hindi content
Business Applications
Voice Assistants
Integrates natural voice output for applications
Low-latency response
Advertising Content
Quickly generates advertising voices in different styles
Supports multiple speaker styles
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase