F5-TTS-Arabic Open-source Speech Synthesis Model - High-quality Arabic Speech Generation with Support for Multi-regional Pronunciation and Accents

F5 TTS Arabic

Developed by IbrahimSalah

A high-quality Arabic speech synthesis model fine-tuned based on F5-TTS, supporting diverse pronunciations and accents from different regions

Speech Synthesis Supports Multiple Languages#Arabic Speech Synthesis #Multi-dialect Support #Continuous Fine-tuning

Downloads 104

Release Time : 2/12/2025

Model Overview

This project fine-tunes the F5-TTS model, focusing on Arabic speech synthesis to provide natural and fluent voice output. The model is still under continuous fine-tuning, with temporary checkpoints provided as progress updates.

Model Features

Multi-dialect Support

Covers diverse Arabic pronunciations and accents from different regions

Continuous Improvement

The model is still under continuous fine-tuning, with future versions offering higher-quality speech synthesis

High-Quality Dataset

Trained on MBZUAI/ClArTTS and Common Voice datasets

Model Capabilities

Arabic Text-to-Speech

Supports multiple Arabic accents

Generates natural and fluent speech

Use Cases

Speech Synthesis Applications

Voice Assistants

Provides natural voice interaction for Arabic-speaking users

Includes sample audio to demonstrate synthesis effects

Audiobooks

Converts Arabic text to speech for audiobook production

🚀 F5-TTS: Fine-Tuned Arabic Speech Synthesis Model

This project fine-tunes the F5-TTS model for high - quality Arabic speech synthesis, addressing regional pronunciation and accent diversity.

🚀 Quick Start

To use the fine-tuned Arabic model, follow these steps:

Usage

GitHub Repository: Follow the F5-TTS setup instructions, but replace the default model with the Arabic checkpoint and vocabulary files provided here.

✨ Features

This project fine-tunes the F5-TTS model for high-quality Arabic speech synthesis, incorporating regional diversity in pronunciation and accents.
The fine-tuning process is ongoing, and temporary checkpoints are provided as progress updates. Future iterations will include improved models with enhanced accuracy and naturalness.

📦 Installation

No specific installation steps are provided in the original README.

💻 Usage Examples

Samples for now

1- "لكن على ما يبدو ان هناك تصاعد غير مسبوق للاحداث."
2- "لذلك يجب علينا الإتحاد فى وجه كل الصدامات التى قد تؤثر علينا."
3- "كان هناك الكثير من التحديات للوصول إلى الدقه المطلوبة."

📚 Documentation

Update

Three checkpoints have been added to the repo. The 380000 checkpoint is the latest. More data is needed to get better results, so fine-tuning will be stopped until more data is obtained, and then it will resume.

Overview

This project fine-tunes the F5-TTS model for high-quality Arabic speech synthesis, incorporating regional diversity in pronunciation and accents. The fine-tuning process is ongoing, and temporary checkpoints are provided as progress updates. Future iterations will include improved models with enhanced accuracy and naturalness.

Model Information

Property	Details
Model Type	F5-TTS, fine - tuned for Arabic
Base Model	SWivid/F5-TTS
Current Status	Ongoing fine-tuning (Temporary Checkpoints Available)
Training Data	MBZUAI/ClArTTS, mozilla-foundation/common_voice_17_0
(Final training parameters will be updated upon completion of fine-tuning.)

Datasets

Training is based on the MBZUAI/ClArTTS, so basically the model supports MSA.

🔧 Technical Details

No specific technical details are provided in the original README.

📄 License

This model is released under the CC BY-NC 4.0 license, which allows free usage, modification, and distribution for non-commercial purposes.

💡 Usage Tip

Use clear reference audio with minimal background noise.
Ensure balanced audio levels for improved synthesis quality.
Contributions in dataset expansion and model evaluation are highly valuable.

🤝 Contributions & Collaboration

This model is a work in progress, and community contributions are highly encouraged! Suggestions, improvements, and dataset contributions are welcome to refine its performance across different Arabic dialects.

Acknowledgment

This work is done using Zewail City of science and technology machine

If you have any questions or suggestions, feel free to reach out! 🚀

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご