Muyan-TTS-SFT-Q8_0-GGUF Open-source Text-to-Speech Model - Free Deployment for Chinese Speech Synthesis

Muyan TTS SFT Q8 0 GGUF

Developed by NikolayKozloff

This model is a GGUF format text-to-speech model converted from MYZY-AI/Muyan-TTS-SFT, supporting Chinese speech synthesis.

Speech Synthesis #Chinese speech synthesis #Lightweight deployment #Llama.cpp optimization

Downloads 20

Release Time : 4/30/2025

Model Overview

Muyan-TTS-SFT-Q8_0-GGUF is a text-to-speech (TTS) model converted from the original model to GGUF format using the llama.cpp tool, suitable for Chinese speech synthesis tasks.

Model Features

GGUF format support

The model has been converted to GGUF format and can be efficiently run using the llama.cpp tool

Chinese speech synthesis

Optimized specifically for Chinese text speech synthesis

Model Capabilities

Text-to-speech

Chinese speech synthesis

Use Cases

Speech synthesis

Chinese speech broadcast

Convert Chinese text into natural speech output

🚀 NikolayKozloff/Muyan-TTS-SFT-Q8_0-GGUF

This project offers a model converted to the GGUF format. It solves the problem of compatibility and efficient usage of the text - to - speech model in the GGUF format. By converting the original model, it enables users to leverage the benefits of the GGUF format for text - to - speech tasks.

🚀 Quick Start

This model was converted to GGUF format from MYZY-AI/Muyan-TTS-SFT using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

✨ Features

Converted to GGUF format for better compatibility and performance.
Can be used with llama.cpp for text - to - speech tasks.

📦 Installation

Install llama.cpp through brew (works on Mac and Linux)

brew install llama.cpp

💻 Usage Examples

Basic Usage

CLI:

llama-cli --hf-repo NikolayKozloff/Muyan-TTS-SFT-Q8_0-GGUF --hf-file muyan-tts-sft-q8_0.gguf -p "The meaning to life and the universe is"

Server:

llama-server --hf-repo NikolayKozloff/Muyan-TTS-SFT-Q8_0-GGUF --hf-file muyan-tts-sft-q8_0.gguf -c 2048

Advanced Usage

You can also use this checkpoint directly through the usage steps listed in the Llama.cpp repo as well.

Step 1: Clone llama.cpp from GitHub.

git clone https://github.com/ggerganov/llama.cpp

Step 2: Move into the llama.cpp folder and build it with LLAMA_CURL = 1 flag along with other hardware - specific flags (for ex: LLAMA_CUDA = 1 for Nvidia GPUs on Linux).

cd llama.cpp && LLAMA_CURL = 1 make

Step 3: Run inference through the main binary.

./llama-cli --hf-repo NikolayKozloff/Muyan-TTS-SFT-Q8_0-GGUF --hf-file muyan-tts-sft-q8_0.gguf -p "The meaning to life and the universe is"

./llama-server --hf-repo NikolayKozloff/Muyan-TTS-SFT-Q8_0-GGUF --hf-file muyan-tts-sft-q8_0.gguf -c 2048

📚 Documentation

Property	Details
Model Type	Converted to GGUF format from `MYZY-AI/Muyan-TTS-SFT`
Training Data	Refer to the original model card

💡 Usage Tip

You can follow the steps in the Llama.cpp repo for more advanced usage and customization.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご