F5 TTS Vietnamese 100h
F
F5 TTS Vietnamese 100h
Developed by hynt
A compact version fine-tuned based on F5-TTS, trained with 150 hours of Vietnamese speech data, for research purposes only.
Downloads 123
Release Time : 3/23/2025
Model Overview
This is a text-to-speech (TTS) model optimized for Vietnamese, fine-tuned based on the F5-TTS architecture, suitable for Vietnamese speech synthesis tasks.
Model Features
High-quality Vietnamese speech synthesis
Trained with 150 hours of carefully selected Vietnamese speech data, providing high-quality speech synthesis results.
Strict data processing
Used demucs to remove background music, filtered audio shorter than 1 second or longer than 30 seconds to ensure data quality.
Academic collaboration datasets
Includes VLSP series datasets and 50 hours of high-quality annotated data provided by UEH University.
Model Capabilities
Vietnamese text-to-speech
Speech synthesis
Voice cloning (via reference audio)
Use Cases
Academic research
Vietnamese speech synthesis research
Used for research and experiments in speech synthesis technology.
Educational applications
Vietnamese learning assistance
Provides pronunciation references for Vietnamese learners.
Featured Recommended AI Models