Tgiangvoice
Spark-TTS is an advanced text-to-speech system that leverages the powerful capabilities of large language models (LLMs) to achieve highly accurate and naturally fluent speech synthesis.
Downloads 16
Release Time : 4/19/2025
Model Overview
This system is designed for efficiency, flexibility, and robust performance, suitable for both research and production purposes. The model is trained on the viVoice Vietnamese dataset.
Model Features
High-quality speech synthesis
Utilizes large language models to achieve highly accurate and naturally fluent speech synthesis
Efficient and flexible
Designed for efficiency and flexibility, suitable for both research and production purposes
Vietnamese language support
Speech synthesis model specifically optimized for Vietnamese
Model Capabilities
Vietnamese text-to-speech
Voice cloning
Speech synthesis
Use Cases
Speech applications
Voice assistants
Provides natural speech output for Vietnamese voice assistants
Generates naturally fluent Vietnamese speech
Audiobooks
Converts Vietnamese text into audiobooks
High-quality speech output
Voice cloning
Clones specific voices based on a few samples
Generates output similar to the reference voice
Featured Recommended AI Models