IndicF5 Open-source Multilingual Text-to-Speech Model - Supports 11 Indian Languages, Nearly Human-like Pronunciation

Indicf5

Developed by ai4bharat

IndicF5 is a near-human multilingual text-to-speech (TTS) model trained on 1,417 hours of high-quality speech data, supporting 11 Indian languages.

Speech Synthesis

Safetensors

Other#Multilingual Speech Synthesis #Prosody Cloning #Support for Indian Languages

Downloads 6,595

Release Time : 3/11/2025

Model Overview

IndicF5 is a high-quality text-to-speech system specifically designed for Indian languages, capable of generating synthetic audio that closely resembles human speech.

Model Features

Multilingual Support

Supports 11 Indian languages, covering major Indian language families.

High-Quality Speech Synthesis

Trained on 1,417 hours of high-quality speech data to generate synthetic audio that closely resembles human speech.

Prosody Control

Controls the prosody and speaker characteristics of generated speech through reference prompt audio.

Model Capabilities

Text-to-Speech

Multilingual Speech Synthesis

Prosody Control

Use Cases

Voice Assistants

Multilingual Voice Assistant

Develop voice assistants supporting multiple local languages for the Indian region.

Provides a natural and smooth multilingual voice interaction experience.

Education

Language Learning Tool

Provides pronunciation demonstrations for learners of Indian languages.

Helps learners master correct pronunciation and intonation.

🚀 IndicF5: High-Quality Text-to-Speech for Indian Languages

IndicF5 is a near - human polyglot Text - to - Speech (TTS) model that offers high - quality speech synthesis for multiple Indian languages.

Datasets

ai4bharat/indicvoices_r
ai4bharat/Rasa

Supported Languages

as (Assamese)
bn (Bengali)
gu (Gujarati)
mr (Marathi)
hi (Hindi)
kn (Kannada)
ml (Malayalam)
or (Odia)
pa (Punjabi)
ta (Tamil)
te (Telugu)

Pipeline Tag

text - to - speech

We release IndicF5, a near - human polyglot Text - to - Speech (TTS) model trained on 1417 hours of high - quality speech from Rasa, IndicTTS, LIMMITS, and IndicVoices - R.

IndicF5 supports 11 Indian languages:
Assamese, Bengali, Gujarati, Hindi, Kannada, Malayalam, Marathi, Odia, Punjabi, Tamil, Telugu.

🚀 Quick Start

📦 Installation

conda create -n indicf5 python=3.10 -y
conda activate indicf5
pip install git+https://github.com/ai4bharat/IndicF5.git

💻 Usage Examples

Basic Usage

To generate speech, you need to provide three inputs:

Text to synthesize – The content you want the model to speak.
A reference prompt audio – An example speech clip that guides the model’s prosody and speaker characteristics.
Text spoken in the reference prompt audio – The transcript of the reference prompt audio.

from transformers import AutoModel
import numpy as np
import soundfile as sf

# Load IndicF5 from Hugging Face
repo_id = "ai4bharat/IndicF5"
model = AutoModel.from_pretrained(repo_id, trust_remote_code=True)

# Generate speech
audio = model(
    "नमस्ते! संगीत की तरह जीवन भी खूबसूरत होता है, बस इसे सही ताल में जीना आना चाहिए.",
    ref_audio_path="prompts/PAN_F_HAPPY_00001.wav",
    ref_text="ਭਹੰਪੀ ਵਿੱਚ ਸਮਾਰਕਾਂ ਦੇ ਭਵਨ ਨਿਰਮਾਣ ਕਲਾ ਦੇ ਵੇਰਵੇ ਗੁੰਝਲਦਾਰ ਅਤੇ ਹੈਰਾਨ ਕਰਨ ਵਾਲੇ ਹਨ, ਜੋ ਮੈਨੂੰ ਖੁਸ਼ ਕਰਦੇ  ਹਨ।"
)

# Normalize and save output
if audio.dtype == np.int16:
    audio = audio.astype(np.float32) / 32768.0
sf.write("namaste.wav", np.array(audio, dtype=np.float32), samplerate=24000)
print("Audio saved succesfully.")

You can find example prompt audios used here.

📚 Documentation

Terms of Use

⚠️ Important Note

By using this model, you agree to only clone voices for which you have explicit permission. Unauthorized voice cloning is strictly prohibited. Any misuse of this model is the responsibility of the user.

References

We would like to extend our gratitude to the authors of [F5 - TTS](https://github.com/SWivid/F5 - TTS) for their invaluable contributions and inspiration to this work. Their efforts have played a crucial role in advancing the field of text - to - speech synthesis.

📖 Citation

If you use IndicF5 in your research or projects, please consider citing it:

🔹 BibTeX

@misc{AI4Bharat_IndicF5_2025,
  author       = {Praveen S V and Srija Anand and Soma Siddhartha and Mitesh M. Khapra},
  title        = {IndicF5: High - Quality Text - to - Speech for Indian Languages},
  year         = {2025},
  url          = {https://github.com/AI4Bharat/IndicF5},
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご