P

Parler Tts Mini V1 Paraspeechcaps Only Base

Developed by ajd12342
A text-to-speech model capable of controlling rich speech styles through textual style prompts
Downloads 17
Release Time : 2/28/2025

Model Overview

This model is a fine-tuned text-to-speech model based on the ParaSpeechCaps-Base dataset, capable of controlling speech features such as pitch, rhythm, clarity, and emotion through style prompts.

Model Features

Rich Style Control
Precisely control speech features such as pitch, rhythm, clarity, and emotion through text prompts
High-Quality Speech Generation
Fine-tuned on a human-annotated dataset, generating high-quality speech
Diverse Style Labels
Supports 59 style labels, covering speaker-inherent styles and contextual sentence styles

Model Capabilities

Text-to-Speech
Speech Style Control
Emotional Speech Synthesis

Use Cases

Speech Synthesis Applications
Audiobook Generation
Generate expressive audiobooks based on text content and emotional prompts
Voice Assistants
Provide more natural and emotionally rich voice output for voice assistants
Assistive Technologies
Visual Impairment Assistance
Provide more natural and comprehensible voice output for visually impaired users
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase