Fastspeech2 En 200 Speaker Cv4
An English text-to-speech model based on the FastSpeech 2 architecture, supporting 200 different voices, trained on the Common Voice v4 dataset.
Downloads 37
Release Time : 3/2/2022
Model Overview
This is a multi-speaker text-to-speech model capable of converting English text into natural speech, supporting 200 different male and female voices.
Model Features
Multi-speaker support
The model supports 200 different male and female voices, allowing random speaker selection during use.
High-quality speech synthesis
Based on the FastSpeech 2 architecture, it can generate natural and fluent speech output.
Large-scale dataset training
Trained on the Common Voice v4 dataset, ensuring the model's generalization capability.
Model Capabilities
English text-to-speech
Multi-speaker speech synthesis
Use Cases
Speech synthesis applications
Voice assistants
Provides natural multi-voice speech output for voice assistant systems.
Generates natural and fluent speech responses
Audiobooks
Automatically converts text content into audiobooks with multiple voices.
Supports reading in 200 different voices
Featured Recommended AI Models