P

Parler Tts Mini V1 Paraspeechcaps

Developed by ajd12342
A fine-tuned text-to-speech model based on Parler-TTS Mini v1, supporting voice output control via style prompts
Downloads 139
Release Time : 2/27/2025

Model Overview

This model is fine-tuned on the ParaSpeechCaps dataset and can generate richly styled speech outputs through text style prompts (such as pitch, rhythm, clarity, emotion, etc.).

Model Features

Style Control
Supports precise control of voice output style features (such as pitch, rhythm, emotion, etc.) through text prompts
Large-Scale Style Annotation
Trained on the ParaSpeechCaps dataset, which includes rich annotations for 59 style labels
Multimodal Training
Novel training pipeline combining text and speech encoders, classifiers, and audio language models

Model Capabilities

Text-to-Speech
Speech Style Control
Multi-Style Speech Generation

Use Cases

Speech Synthesis
Emotional Speech Generation
Generates speech with specific emotions based on text prompts
Can produce speech outputs with different emotions such as sadness, happiness, etc.
Stylized Voice Creation
Creates voices with specific styles for films, games, etc.
Can control parameters like speech rate and clarity to generate professional-grade voices
Assistive Technology
Accessible Speech Synthesis
Provides customizable voice outputs for visually impaired users
Can adjust voice features according to user preferences
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase