This project fine-tunes the F5-TTS model, focusing on Arabic speech synthesis to provide natural and fluent voice output. The model is still under continuous fine-tuning, with temporary checkpoints provided as progress updates.
Model Features
Multi-dialect Support
Covers diverse Arabic pronunciations and accents from different regions
Continuous Improvement
The model is still under continuous fine-tuning, with future versions offering higher-quality speech synthesis
High-Quality Dataset
Trained on MBZUAI/ClArTTS and Common Voice datasets
Model Capabilities
Arabic Text-to-Speech
Supports multiple Arabic accents
Generates natural and fluent speech
Use Cases
Speech Synthesis Applications
Voice Assistants
Provides natural voice interaction for Arabic-speaking users
Includes sample audio to demonstrate synthesis effects
Audiobooks
Converts Arabic text to speech for audiobook production
🚀 F5-TTS: Fine-Tuned Arabic Speech Synthesis Model
This project fine-tunes the F5-TTS model for high - quality Arabic speech synthesis, addressing regional pronunciation and accent diversity.
🚀 Quick Start
To use the fine-tuned Arabic model, follow these steps:
Usage
GitHub Repository: Follow the F5-TTS setup instructions, but replace the default model with the Arabic checkpoint and vocabulary files provided here.
✨ Features
This project fine-tunes the F5-TTS model for high-quality Arabic speech synthesis, incorporating regional diversity in pronunciation and accents.
The fine-tuning process is ongoing, and temporary checkpoints are provided as progress updates. Future iterations will include improved models with enhanced accuracy and naturalness.
📦 Installation
No specific installation steps are provided in the original README.
💻 Usage Examples
Samples for now
1- "لكن على ما يبدو ان هناك تصاعد غير مسبوق للاحداث."
2- "لذلك يجب علينا الإتحاد فى وجه كل الصدامات التى قد تؤثر علينا."
3- "كان هناك الكثير من التحديات للوصول إلى الدقه المطلوبة."
1-
2-
3-
📚 Documentation
Update
Three checkpoints have been added to the repo. The 380000 checkpoint is the latest. More data is needed to get better results, so fine-tuning will be stopped until more data is obtained, and then it will resume.
Overview
This project fine-tunes the F5-TTS model for high-quality Arabic speech synthesis, incorporating regional diversity in pronunciation and accents. The fine-tuning process is ongoing, and temporary checkpoints are provided as progress updates. Future iterations will include improved models with enhanced accuracy and naturalness.
(Final training parameters will be updated upon completion of fine-tuning.)
Datasets
Training is based on the MBZUAI/ClArTTS, so basically the model supports MSA.
🔧 Technical Details
No specific technical details are provided in the original README.
📄 License
This model is released under the CC BY-NC 4.0 license, which allows free usage, modification, and distribution for non-commercial purposes.
💡 Usage Tip
Use clear reference audio with minimal background noise.
Ensure balanced audio levels for improved synthesis quality.
Contributions in dataset expansion and model evaluation are highly valuable.
🤝 Contributions & Collaboration
This model is a work in progress, and community contributions are highly encouraged! Suggestions, improvements, and dataset contributions are welcome to refine its performance across different Arabic dialects.
Acknowledgment
This work is done using Zewail City of science and technology machine
If you have any questions or suggestions, feel free to reach out! 🚀