OpenBuddy-R1, a lightweight large language model, is open-sourced - Free support for communication in 8 languages.

Openbuddy R1 0528 Distill Qwen3 32B Preview1 QAT Q3 K M GGUF

Developed by OpenBuddy

A lightweight multilingual large language model distilled from OpenBuddy/Qwen3-32B, supporting 8 languages and using the Apache 2.0 license

Large Language Model Supports Multiple LanguagesOpen Source License:Apache-2.0 #Multilingual text generation #High compression quantization #Apache 2.0 open source

Downloads 121

Release Time : 6/12/2025

Model Overview

This is a 32B parameter large language model converted to the GGUF format via llama.cpp, suitable for text generation tasks and supporting multiple languages such as Chinese and English

Model Features

Multilingual support

Supports text generation in 8 major languages

Lightweight deployment

Achieves efficient deployment through the GGUF format and quantization technology

Open source license

Uses the Apache 2.0 license, allowing commercial use

Model Capabilities

Multilingual text generation

Long text processing

Dialogue system

Use Cases

Content creation

Multilingual article writing

Generate blog articles or news content in different languages

Dialogue system

Multilingual chatbot

Build an intelligent dialogue system supporting multiple languages

🚀 ff670/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview1-QAT-Q3_K_M-GGUF

This model is converted to GGUF format from the original model, aiming to provide more convenient and efficient text - generation services.

🚀 Quick Start

This model was converted to GGUF format from OpenBuddy/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview1-QAT using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

📦 Installation

Install llama.cpp through brew (works on Mac and Linux)

brew install llama.cpp

💻 Usage Examples

Basic Usage

Invoke the llama.cpp server or the CLI.

CLI:

llama-cli --hf-repo ff670/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview1-QAT-Q3_K_M-GGUF --hf-file openbuddy-r1-0528-distill-qwen3-32b-preview1-qat-q3_k_m.gguf -p "The meaning to life and the universe is"

Server:

llama-server --hf-repo ff670/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview1-QAT-Q3_K_M-GGUF --hf-file openbuddy-r1-0528-distill-qwen3-32b-preview1-qat-q3_k_m.gguf -c 2048

Advanced Usage

Note: You can also use this checkpoint directly through the usage steps listed in the Llama.cpp repo as well.

Step 1: Clone llama.cpp from GitHub.

git clone https://github.com/ggerganov/llama.cpp

Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL = 1` flag along with other hardware - specific flags (for ex: LLAMA_CUDA = 1 for Nvidia GPUs on Linux).

cd llama.cpp && LLAMA_CURL = 1 make

Step 3: Run inference through the main binary.

./llama-cli --hf-repo ff670/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview1-QAT-Q3_K_M-GGUF --hf-file openbuddy-r1-0528-distill-qwen3-32b-preview1-qat-q3_k_m.gguf -p "The meaning to life and the universe is"

./llama-server --hf-repo ff670/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview1-QAT-Q3_K_M-GGUF --hf-file openbuddy-r1-0528-distill-qwen3-32b-preview1-qat-q3_k_m.gguf -c 2048

📄 License

The license for this project is apache - 2.0.

Property	Details
Supported Languages	zh, en, fr, de, ja, ko, it, fi
Model Type	Text Generation
Base Model	OpenBuddy/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview1-QAT
License	apache-2.0
Tags	qwen3, llama-cpp, gguf-my-repo

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご