C

Canary Tts 150m

Developed by 2121-8
Japanese TTS speech synthesis system trained based on llm-jp/llm-jp-3-150m-instruct3, supports audio quality adjustment via prompts
Downloads 36
Release Time : 4/22/2025

Model Overview

Experimental Japanese speech synthesis model, utilizing Parler-TTS prompt architecture and XCodec2 audio decoder, allows pitch and background noise adjustment through control prompts

Model Features

Prompt Control
Fine-tune audio quality by modifying control prompts and reading prompts
Lightweight Design
150M parameter scale suitable for deployment in resource-constrained environments
High-Quality Audio Output
Uses XCodec2 audio decoder to ensure speech quality

Model Capabilities

Japanese Speech Synthesis
Pitch Adjustment
Background Noise Control
Text-to-Speech

Use Cases

Voice Interaction
Virtual Assistant
Provides natural speech output for Japanese virtual assistants
Generates speech with emotional characteristics
Content Creation
Audio Content Generation
Automatically converts Japanese text to speech
Supports speech output with different tones and intonations
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase