VITS-CMN Open-Source Text-to-Speech Model - Free Deployment for High-Quality Chinese Speech Synthesis

Vits Cmn

Developed by BricksDisplay

VITS is an end-to-end text-to-speech model based on adversarial learning and conditional variational autoencoder, supporting Chinese speech synthesis.

Speech Synthesis

Transformers

ChineseOpen Source License:Apache-2.0 #Chinese TTS #Multi-speaker #End-to-end speech synthesis

Downloads 21

Release Time : 1/10/2024

Model Overview

This model adopts a conditional variational autoencoder architecture, capable of predicting corresponding speech waveforms from input text sequences and supports 44 different speakers.

Model Features

End-to-end speech synthesis

Directly generates speech waveforms from text without intermediate feature extraction steps.

Multi-speaker support

Supports speech synthesis for 44 different speakers.

Adversarial learning training

Uses adversarial training strategies to improve speech quality and naturalness.

Chinese optimization

Specifically optimized for Chinese speech characteristics, supports pinyin input.

Model Capabilities

Chinese text-to-speech

Multi-speaker speech synthesis

High-quality speech generation

Use Cases

Voice interaction

Smart voice assistant

Provides natural Chinese speech output capabilities for smart devices.

Generates natural and fluent Chinese speech

Accessibility applications

Text-to-speech

Provides text-to-speech functionality for visually impaired users.

High-quality Chinese speech output

Multimedia production

Video dubbing

Automatically generates Chinese dubbing for video content.

Multiple speaker choices, natural speech effects

Property	Details
Languages	Chinese
Dataset	THCHS-30
Speakers	44
Training Hours	48

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Vits Cmn

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

🚀 Quick Start

📚 Documentation

Model Details

💻 Usage Examples

Basic Usage

Advanced Usage

📄 License