🚀 Orpheus Bangla Emotional TTS Model
This is a Text-to-Speech model for the Bangla (Bengali) language. Based on the Orpheus architecture, it can generate natural-sounding Bangla speech from text input with various emotional styles.
🚀 Quick Start
This is a Text-to-Speech model for Bangla (Bengali) language based on the Orpheus architecture. The model can generate natural-sounding Bangla speech from text input with various emotional styles.
✨ Features
- Generate natural - sounding Bangla speech with various emotional styles.
- Support multiple emotional tones such as happy, sad, angry, etc.
📦 Installation
No specific installation steps provided in the original document, so this section is skipped.
💻 Usage Examples
Basic Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
from snac import SNAC
import torch
import soundfile as sf
tokenizer = AutoTokenizer.from_pretrained("ehzawad/orpheus-bangla-emotional-tts")
model = AutoModelForCausalLM.from_pretrained(
"ehzawad/orpheus-bangla-emotional-tts",
torch_dtype=torch.bfloat16,
device_map="auto"
)
snac_model = SNAC.from_pretrained("hubertsiuzdak/snac_24khz").to("cuda")
prompt = "আপনি কেমন আছেন?"
emotional_prompt = "<happy>আপনি কেমন আছেন?</happy>"
Advanced Usage
No advanced usage code provided in the original document, so this part is skipped.
📚 Documentation
🎮 Demo
Try out the model directly using our Hugging Face Space:
Model Details
Property |
Details |
Model Type |
Text - to - Speech |
Language |
Bangla (Bengali) |
Architecture |
Orpheus TTS + SNAC decoder |
Sample Rate |
24kHz |
Special Features |
Emotional speech synthesis |
Emotions |
Various emotional tones including happy, sad, angry, disgusted, frustrated, excited, curious, surprised, etc. |
Supported Emotions
This model supports the following emotional styles:
- happy
- normal
- disgust
- sad
- frustrated
- slow
- excited
- whisper
- panicky
- curious
- surprise
- fast
- crying
- deep
- sleepy
- angry
- high
- shout
🔧 Technical Details
No specific technical implementation details provided in the original document, so this section is skipped.
📄 License
This model is released under the apache - 2.0
license.
📚 Citation
If you use this model, please cite:
@misc{ehzawad2025orpheusbangla,
author = {Ehzawad},
title = {Orpheus Bangla Emotional TTS},
year = {2025},
publisher = {HuggingFace},
howpublished = {\url{https://huggingface.co/ehzawad/orpheus-bangla-emotional-tts}}
}
👏 Acknowledgements
This model is based on the Orpheus TTS architecture developed by Canopy Labs. We extend our gratitude to the original authors for their work.