F

F5 TTS Vietnamese 100h

Developed by hynt
A compact version fine-tuned based on F5-TTS, trained with 150 hours of Vietnamese speech data, for research purposes only.
Downloads 123
Release Time : 3/23/2025

Model Overview

This is a text-to-speech (TTS) model optimized for Vietnamese, fine-tuned based on the F5-TTS architecture, suitable for Vietnamese speech synthesis tasks.

Model Features

High-quality Vietnamese speech synthesis
Trained with 150 hours of carefully selected Vietnamese speech data, providing high-quality speech synthesis results.
Strict data processing
Used demucs to remove background music, filtered audio shorter than 1 second or longer than 30 seconds to ensure data quality.
Academic collaboration datasets
Includes VLSP series datasets and 50 hours of high-quality annotated data provided by UEH University.

Model Capabilities

Vietnamese text-to-speech
Speech synthesis
Voice cloning (via reference audio)

Use Cases

Academic research
Vietnamese speech synthesis research
Used for research and experiments in speech synthesis technology.
Educational applications
Vietnamese learning assistance
Provides pronunciation references for Vietnamese learners.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase