D

Dia 1.6B

Developed by nari-labs
Dia is a 1.6 billion parameter text-to-speech model developed by Nari Labs, capable of generating highly realistic conversations directly from text, supporting emotional and tonal control, and producing non-verbal communication content.
Downloads 80.28k
Release Time : 4/20/2025

Model Overview

Dia is an open-weight text-to-dialogue model that supports emotional and tonal control through audio-conditioned output and can generate non-verbal communication content such as laughter and coughing.

Model Features

Highly Realistic Dialogue Generation
Capable of generating highly realistic dialogues directly from text, supporting emotional and tonal control.
Non-verbal Communication Generation
Can generate non-verbal communication content such as laughter, coughing, and throat clearing.
Voice Cloning
Supports voice cloning functionality, allowing users to replicate voices by uploading audio samples.
Open Weights
The model weights are fully open-source, giving users complete control over scripts and speech.

Model Capabilities

Text-to-Speech
Emotional and Tonal Control
Non-verbal Communication Generation
Voice Cloning

Use Cases

Dialogue Generation
Dia Introduction
Generate dialogue content introducing the Dia model
Highly realistic dialogue effects
Emergency Scenarios
Generate dialogue content for emergency situations
Emotionally rich speech output
Voice Cloning
Custom Voice
Clone a specific voice by uploading audio
Generate speech resembling the cloned voice
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase