Orpheus Bangla Emotional TTS Open-Source Model - Free Multilingual Emotion Style Reading of Bangla Texts

Home

Orpheus Bangla Emotional Tts

Developed by ehzawad

Bengali text-to-speech model based on Orpheus architecture, supporting multiple emotional styles

Speech Synthesis

Safetensors

OtherOpen Source License:Apache-2.0 #Bengali TTS #Multi-emotional speech synthesis #24kHz high quality

Downloads 26

Release Time : 4/13/2025

Model Overview

This model can generate natural and fluent Bengali speech from text input, with special support for multi-emotional speech synthesis.

Model Features

Multi-emotional speech synthesis

Supports 18 different emotional styles including happiness, sadness, anger, etc.

High-quality audio

Generates natural and fluent speech with 24kHz sampling rate

Professional architecture

Based on Orpheus TTS architecture and SNAC decoder to ensure speech quality

Model Capabilities

Bengali text-to-speech

Emotional speech synthesis

High-quality audio generation

Use Cases

Voice interaction applications

Smart assistants

Provides emotional voice output for Bengali smart assistants

Enhances user experience and interaction

Audiobooks

Automatically generates emotional audiobook content

Enhances storytelling expressiveness

Assistive technology

Visual impairment assistance

Provides more natural voice reading experience for visually impaired people

Improves information accessibility

🚀 Orpheus Bangla Emotional TTS Model

This is a Text-to-Speech model for the Bangla (Bengali) language. Based on the Orpheus architecture, it can generate natural-sounding Bangla speech from text input with various emotional styles.

🚀 Quick Start

This is a Text-to-Speech model for Bangla (Bengali) language based on the Orpheus architecture. The model can generate natural-sounding Bangla speech from text input with various emotional styles.

✨ Features

Generate natural - sounding Bangla speech with various emotional styles.
Support multiple emotional tones such as happy, sad, angry, etc.

📦 Installation

No specific installation steps provided in the original document, so this section is skipped.

💻 Usage Examples

Basic Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
from snac import SNAC
import torch
import soundfile as sf

# Load the models
tokenizer = AutoTokenizer.from_pretrained("ehzawad/orpheus-bangla-emotional-tts")
model = AutoModelForCausalLM.from_pretrained(
    "ehzawad/orpheus-bangla-emotional-tts", 
    torch_dtype=torch.bfloat16,
    device_map="auto"
)
snac_model = SNAC.from_pretrained("hubertsiuzdak/snac_24khz").to("cuda")

# Sample prompt in Bangla
prompt = "আপনি কেমন আছেন?"  # "How are you?" in Bangla

# For emotional speech, add the emotion tag (e.g., for happy)
emotional_prompt = "<happy>আপনি কেমন আছেন?</happy>"

# Add your inference code here
# (Follow the example_usage.py code for complete inference)

Advanced Usage

No advanced usage code provided in the original document, so this part is skipped.

📚 Documentation

🎮 Demo

Try out the model directly using our Hugging Face Space:

Model Details

Property	Details
Model Type	Text - to - Speech
Language	Bangla (Bengali)
Architecture	Orpheus TTS + SNAC decoder
Sample Rate	24kHz
Special Features	Emotional speech synthesis
Emotions	Various emotional tones including happy, sad, angry, disgusted, frustrated, excited, curious, surprised, etc.

Supported Emotions

This model supports the following emotional styles:

happy
normal
disgust
sad
frustrated
slow
excited
whisper
panicky
curious
surprise
fast
crying
deep
sleepy
angry
high
shout

🔧 Technical Details

No specific technical implementation details provided in the original document, so this section is skipped.

📄 License

This model is released under the apache - 2.0 license.

📚 Citation

If you use this model, please cite:

@misc{ehzawad2025orpheusbangla,
  author = {Ehzawad},
  title = {Orpheus Bangla Emotional TTS},
  year = {2025},
  publisher = {HuggingFace},
  howpublished = {\url{https://huggingface.co/ehzawad/orpheus-bangla-emotional-tts}}
}

👏 Acknowledgements

This model is based on the Orpheus TTS architecture developed by Canopy Labs. We extend our gratitude to the original authors for their work.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご