Vi SparkTTS 0.5B
Spark-TTS is an advanced text-to-speech system that leverages the powerful capabilities of large language models (LLMs) to achieve high-precision and natural-sounding speech synthesis.
Downloads 3,804
Release Time : 3/31/2025
Model Overview
A high-quality text-to-speech system trained on the viVoice Vietnamese dataset, designed for both research and production environments with efficiency, flexibility, and robust functionality.
Model Features
High-quality speech synthesis
Utilizes large language models to achieve high-precision and natural-sounding speech synthesis
Professional dataset training
Trained on the viVoice Vietnamese professional dataset
Dual-purpose for research and production
Designed for both research and production environments, combining efficiency and flexibility
Model Capabilities
Vietnamese text-to-speech
Voice cloning
Speech synthesis
Use Cases
Speech synthesis applications
Voice assistants
Provides natural voice output for Vietnamese voice assistants
Highly natural voice output
Audiobooks
Converts Vietnamese text into audiobooks
Smooth and natural reading effects
Featured Recommended AI Models