Fish Speech V1.5 Open-Source Text-to-Speech Model - Trained with Millions of Hours of Multilingual Data

F15

Developed by cocktailpeanut

Fish Speech V1.5 is a leading text-to-speech (TTS) model trained on over 1 million hours of multilingual audio data.

Downloads 5,162

Release Time : 12/4/2024

Model Overview

Advanced multilingual text-to-speech synthesis system, supporting speech synthesis in 13 languages.

Multilingual Support

Supports text-to-speech in 13 languages, including major Asian and European languages

Large-scale Training Data

Trained on over 1 million hours of multilingual audio data, with over 300,000 hours each for English and Chinese

Academic Research Support

Supported by formally published academic papers on model technology

Text-to-Speech

Multilingual speech synthesis

High-quality voice output

Content Creation

Audiobook Production

Convert text content into natural speech for audiobook production

High-quality multilingual voice output

Video Dubbing

Automatically generate dubbing for video content

Supports dubbing in multiple languages

Assistive Technology

Visual Impairment Assistance

Convert text information into speech output to assist visually impaired individuals

Multilingual support expands usage scope

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base