E

Erax Smile UnixSex F5

Developed by erax-ai
Vietnamese text-to-speech model based on F5-TTS architecture, supporting neutral-style voice cloning
Downloads 120
Release Time : 4/18/2025

Model Overview

This is a Vietnamese text-to-speech model based on the F5-TTS architecture, fine-tuned with over 2,700,000 Vietnamese samples, supporting neutral-style voice cloning and zero-shot voice cloning capabilities.

Model Features

Vietnamese Support
Optimized specifically for Vietnamese, trained with a large number of Vietnamese samples
Voice Cloning
Supports zero-shot voice cloning, capable of generating similar voices based on reference audio
Multi-style Support
Supports female, male, and neutral-style voice generation
Open-source Code
Provides complete open-source implementation code for easy research and secondary development

Model Capabilities

Vietnamese Text-to-Speech
Voice Style Cloning
Neutral Voice Generation
Multi-style Voice Synthesis

Use Cases

Voice Synthesis
News Broadcasting
Generates natural and fluent Vietnamese news broadcast voices
Refer to the audio samples provided on the model page
Audiobooks
Generates narration voices for Vietnamese e-books
Voice Cloning
Personalized Voice Assistant
Clones specific individuals' voices to create personalized voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase