K

Kokoro 82M Light

Developed by ctranslate2-4you
A clone version based on StyleTTS2-LJSpeech, optimized for English text-to-speech tasks with reduced dependencies for simplified deployment.
Downloads 21
Release Time : 1/28/2025

Model Overview

This is a text-to-speech (TTS) model focused on generating high-quality English speech output. Compared to the original version, this repository removes certain dependencies to simplify installation and usage.

Model Features

Streamlined Dependencies
Removed munch and phonemizer dependencies, replaced with direct calls to espeak, significantly reducing dependency count
English Pronunciation Optimization
Added expand_acronym() function to improve pronunciation of specific terms (e.g., NASA)
Lightweight Deployment
Reduced approximately 80 dependencies compared to v1.0, simplifying deployment while maintaining 98% quality

Model Capabilities

English Text-to-Speech
British English Speech Synthesis
Acronym Pronunciation Optimization

Use Cases

Speech Synthesis
Audiobook Generation
Convert English text into natural speech for audiobook production
Generates near-human pronunciation speech output
Voice Assistants
Provide speech synthesis capabilities for English voice assistants
Fluid and natural English speech responses
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase