F

Fastspeech2 En 200 Speaker Cv4

Developed by facebook
An English text-to-speech model based on the FastSpeech 2 architecture, supporting 200 different voices, trained on the Common Voice v4 dataset.
Downloads 37
Release Time : 3/2/2022

Model Overview

This is a multi-speaker text-to-speech model capable of converting English text into natural speech, supporting 200 different male and female voices.

Model Features

Multi-speaker support
The model supports 200 different male and female voices, allowing random speaker selection during use.
High-quality speech synthesis
Based on the FastSpeech 2 architecture, it can generate natural and fluent speech output.
Large-scale dataset training
Trained on the Common Voice v4 dataset, ensuring the model's generalization capability.

Model Capabilities

English text-to-speech
Multi-speaker speech synthesis

Use Cases

Speech synthesis applications
Voice assistants
Provides natural multi-voice speech output for voice assistant systems.
Generates natural and fluent speech responses
Audiobooks
Automatically converts text content into audiobooks with multiple voices.
Supports reading in 200 different voices
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase