LongWriter-Zero-32B-i1-GGUF Open-source Model - Supports both Chinese and English, essential for scenarios such as long-form writing.

Longwriter Zero 32B I1 GGUF

Developed by mradermacher

The LongWriter-Zero-32B quantized model is based on the THU-KEG/LongWriter-Zero-32B base model, supports both Chinese and English, and is suitable for long context scenarios such as reinforcement learning and writing.

Large Language Model

Transformers

Supports Multiple LanguagesOpen Source License:Apache-2.0 #Long context processing #Chinese-English bilingual writing #Reinforcement learning optimization

Downloads 135

Release Time : 6/21/2025

Model Overview

This model is a large language model that supports both Chinese and English. It is specially optimized for long context processing and is suitable for reinforcement learning and writing tasks. Multiple quantization versions are provided to meet different needs.

Model Features

Multilingual support

Supports the processing of both English and Chinese languages

Multiple quantization versions

Provides multiple quantization versions of different sizes and qualities for selection

Long context processing

Specially optimized for performance in long context scenarios, suitable for reinforcement learning and writing tasks

Model Capabilities

Long text generation

Bilingual processing

Reinforcement learning support

Writing assistance

Use Cases

Writing

Long article creation

Assist users in creating and conceptualizing long articles

Generate coherent long texts

Reinforcement learning

Long sequence decision-making

Applied in reinforcement learning scenarios that require long context memory

Better long sequence decision-making ability

🚀 LongWriter-Zero-32B Quantized Model

This project provides quantized versions of the LongWriter-Zero-32B model, offering various quantization options for different usage scenarios.

🚀 Quick Start

If you are unsure how to use GGUF files, refer to one of TheBloke's READMEs for more details, including on how to concatenate multi-part files.

✨ Features

Multi - language Support: Supports both English and Chinese.
Reinforcement Learning: Incorporates reinforcement learning techniques for better performance.
Long Context Handling: Capable of handling long - context writing tasks.

📦 Information

Property	Details
Base Model	THU - KEG/LongWriter - Zero - 32B
Datasets	THU - KEG/LongWriter - Zero - RLData
Languages	English, Chinese
Library Name	transformers
License	apache - 2.0
Quantized By	mradermacher
Tags	reinforcement - learning, writing, Long Context

📚 Documentation

About

The weighted/imatrix quants of https://huggingface.co/THU-KEG/LongWriter-Zero-32B. Static quants are available at https://huggingface.co/mradermacher/LongWriter-Zero-32B-GGUF.

Provided Quants

(sorted by size, not necessarily quality. IQ - quants are often preferable over similar sized non - IQ quants)

Link	Type	Size/GB	Notes
GGUF	i1 - IQ1_S	7.4	for the desperate
GGUF	i1 - IQ1_M	8.0	mostly desperate
GGUF	i1 - IQ2_XXS	9.1
GGUF	i1 - IQ2_XS	10.1
GGUF	i1 - IQ2_S	10.5
GGUF	i1 - IQ2_M	11.4
GGUF	i1 - Q2_K_S	11.6	very low quality
GGUF	i1 - Q2_K	12.4	IQ3_XXS probably better
GGUF	i1 - IQ3_XXS	12.9	lower quality
GGUF	i1 - IQ3_XS	13.8
GGUF	i1 - Q3_K_S	14.5	IQ3_XS probably better
GGUF	i1 - IQ3_S	14.5	beats Q3_K*
GGUF	i1 - IQ3_M	14.9
GGUF	i1 - Q3_K_M	16.0	IQ3_S probably better
GGUF	i1 - Q3_K_L	17.3	IQ3_M probably better
GGUF	i1 - IQ4_XS	17.8
GGUF	i1 - Q4_0	18.8	fast, low quality
GGUF	i1 - Q4_K_S	18.9	optimal size/speed/quality
GGUF	i1 - Q4_K_M	20.0	fast, recommended
GGUF	i1 - Q4_1	20.7
GGUF	i1 - Q5_K_S	22.7
GGUF	i1 - Q5_K_M	23.4
GGUF	i1 - Q6_K	27.0	practically like static Q6_K

Here is a handy graph by ikawrakow comparing some lower - quality quant types (lower is better):

And here are Artefact2's thoughts on the matter: https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9

FAQ / Model Request

See https://huggingface.co/mradermacher/model_requests for some answers to questions you might have and/or if you want some other model quantized.

📄 License

This project is licensed under the apache - 2.0 license.

🙏 Thanks

I thank my company, nethype GmbH, for letting me use its servers and providing upgrades to my workstation to enable this work in my free time. Additional thanks to @nicoboss for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご