Fish Speech 1.5
Fish Speech V1.5 is a leading text-to-speech (TTS) model, trained on over 1 million hours of multilingual audio data.
Speech Synthesis Supports Multiple Languages#Multilingual TTS#Million-hour level training#Academic research friendly
Downloads 98
Release Time : 2/27/2025
Model Overview
Advanced multilingual text-to-speech synthesis system, supporting 13 languages, with special optimizations for Chinese and English speech synthesis.
Model Features
Multilingual support
Supports text-to-speech in 13 languages, with special optimizations for Chinese and English speech synthesis.
Large-scale training data
Trained on over 1 million hours of multilingual audio data, with over 300,000 hours each for Chinese and English.
Academic research support
Related research papers have been published on arXiv, providing academic citation support.
Model Capabilities
Text-to-speech
Multilingual speech synthesis
High-quality speech output
Use Cases
Speech synthesis applications
Voice assistants
Provides natural speech output for smart devices
More natural multilingual speech experience
Audiobooks
Converts text content into speech
High-quality multilingual audio content
Educational applications
Pronunciation assistance for language learning apps
Accurate pronunciation examples
Featured Recommended AI Models