F15
Fish Speech V1.5 is a leading text-to-speech (TTS) model trained on over 1 million hours of multilingual audio data.
Speech Synthesis Supports Multiple Languages#Million-hour level training#13 language support#Academic-grade TTS
Downloads 5,162
Release Time : 12/4/2024
Model Overview
Advanced multilingual text-to-speech synthesis system, supporting speech synthesis in 13 languages.
Model Features
Multilingual Support
Supports text-to-speech in 13 languages, including major Asian and European languages
Large-scale Training Data
Trained on over 1 million hours of multilingual audio data, with over 300,000 hours each for English and Chinese
Academic Research Support
Supported by formally published academic papers on model technology
Model Capabilities
Text-to-Speech
Multilingual speech synthesis
High-quality voice output
Use Cases
Content Creation
Audiobook Production
Convert text content into natural speech for audiobook production
High-quality multilingual voice output
Video Dubbing
Automatically generate dubbing for video content
Supports dubbing in multiple languages
Assistive Technology
Visual Impairment Assistance
Convert text information into speech output to assist visually impaired individuals
Multilingual support expands usage scope
Featured Recommended AI Models