Fish Speech 1.5
Leading text-to-speech (TTS) model trained on over 1 million hours of multilingual audio data
Speech Synthesis
Safetensors Supports Multiple Languages#Multilingual TTS#Million-hour Training#Non-commercial Use
Downloads 194
Release Time : 12/7/2024
Model Overview
Fish Speech V1.5 is a high-performance multilingual text-to-speech model supporting 13 languages, specially optimized for compatibility with the Rust ecosystem.
Model Features
Multilingual Support
Supports 13 languages including Chinese, English, Japanese, and other major languages
Large-scale Training
Trained on over 1 million hours of multilingual audio data
Rust Ecosystem Compatibility
Specially optimized for the fish-speech.rs framework and Candle.rs
Weight Security Format
Uses .safetensors format for weight storage to enhance security
Model Capabilities
High-quality text-to-speech
Multilingual speech synthesis
Supports conversion between 13 languages
Use Cases
Speech Synthesis
Multilingual Voice Assistant
Provides natural voice output for multilingual applications
High-quality, natural-sounding speech synthesis
Audiobook Generation
Automatically converts text into audiobooks in multiple languages
Supports multiple languages and pronunciation styles
Featured Recommended AI Models