K

Kokoro 82M

Developed by hexgrad
Kokoro is an open-source text-to-speech (TTS) model with 82 million parameters, renowned for its lightweight architecture and high audio quality, while also being fast and cost-effective.
Downloads 2.0M
Release Time : 12/26/2024

Model Overview

Kokoro is an Apache-licensed text-to-speech model capable of generating high-quality speech output, suitable for various scenarios from production environments to personal projects.

Model Features

Lightweight architecture
Despite its smaller parameter size, it delivers audio quality comparable to larger models.
Cost efficiency
Less than $1 per million characters of text input and under $0.06 per hour of audio output.
Multilingual support
Supports 8 languages and 54 voices, suitable for diverse application scenarios.
Open-source license
Licensed under Apache, allowing free deployment in commercial and personal projects.

Model Capabilities

Text-to-speech
Multilingual speech synthesis
Efficient audio generation

Use Cases

Commercial applications
Voice assistants
Provides high-quality speech output for commercial applications.
Efficient and low-cost speech synthesis solution.
Audiobooks
Generates natural and fluent audiobook content.
High-quality multilingual speech output.
Personal projects
Personal voice assistants
Offers customized speech output for personal projects.
Lightweight and easy-to-deploy solution.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase