# High-precision text generation
Arliai QwQ 32B ArliAI RpR V4 GGUF
Apache-2.0
A 32B-parameter quantized large language model based on ArliAI/QwQ-32B-ArliAI-RpR-v4, quantized with llama.cpp at various precisions, suitable for text generation tasks.
Large Language Model English
A
bartowski
1,721
1
Thedrummer Valkyrie 49B V1 GGUF
Valkyrie-49B-v1 is a 49B-parameter large language model based on llama.cpp, offering multiple quantization versions suitable for different hardware configurations.
Large Language Model
T
bartowski
64.03k
9
Autogressive 32B
Apache-2.0
Autoregressive-32B is a Multiverse-32B baseline model built based on autoregressive modeling, providing strong support for text generation tasks.
Large Language Model
Transformers

A
Multiverse4FM
1,945
1
Thedrummer Rivermind Lux 12B V1 GGUF
This is a 12B-parameter large language model, processed with llama.cpp's imatrix quantization, offering multiple quantized versions to accommodate different hardware requirements.
Large Language Model
T
bartowski
1,353
1
INTELLECT 2 GGUF
Apache-2.0
INTELLECT 2 is a large language model launched by PrimeIntellect, supporting a context length of 40960 tokens, trained using the QwQ architecture and GRPO reinforcement learning framework.
Large Language Model
I
lmstudio-community
467
5
Paxinium 12b Model Stock
This is a project that uses the mergekit tool to merge multiple pre-trained language models with a 12B parameter scale. The aim is to integrate the advantages of different models and improve language processing capabilities.
Large Language Model
Transformers

P
DreadPoor
212
3
Wanabi 24b V1 GGUF
Apache-2.0
A large-scale language model fine-tuned specifically for Japanese novel writing support
Large Language Model Japanese
W
kawaimasa
274
2
Open Thoughts OpenThinker2 32B GGUF
Apache-2.0
Quantized version of OpenThinker2-32B, using llama.cpp for imatrix quantization, supports multiple quantization types, suitable for text generation tasks.
Large Language Model
O
bartowski
1,332
10
Loyalmaid 12B
LoyalMaid-12B is a large language model with a 12B parameter scale, obtained by fusing multiple pre-trained language models using the mergekit tool.
Large Language Model
Transformers

L
yamatazen
83
5
ABEJA QwQ32b Reasoning Japanese V1.0
Apache-2.0
A Japanese reasoning model developed based on Qwen2.5-32B-Instruct, integrating the ChatVector of QwQ-32B and optimizing Japanese reasoning performance.
Large Language Model
Transformers Japanese

A
abeja
583
10
Community Request 02 12B
Fused from multiple 12B-parameter large language models, equipped with text generation and dialogue capabilities
Large Language Model
Transformers

C
Nitral-AI
53
4
Mistral Small 3.1 24B Instruct 2503 HF Imat GGUF
Apache-2.0
A 24B-parameter instruction-tuned model based on the Mistral architecture, suitable for text generation tasks
Large Language Model
M
qwp4w3hyb
2,573
3
Gemma 3 27b It Abliterated Q8 0 GGUF
This is a GGUF format model converted from mlabonne/gemma-3-27b-it-abliterated, suitable for the llama.cpp framework.
Large Language Model
G
KnutJaegersberg
196
2
Thedrummer Cydonia 24B V2.1 GGUF
Other
Cydonia-24B-v2.1 is a 24B parameter large language model, processed with llama.cpp's imatrix quantization, offering multiple quantized versions to suit different hardware requirements.
Large Language Model
T
bartowski
4,417
7
QWQ Stock
A merged model based on multiple Qwen series 32B parameter models, enhanced with Model Stock method for improved multilingual processing capabilities
Large Language Model
Transformers

Q
wanlige
368
7
Lamarckvergence 14B
Apache-2.0
Lamarckvergence-14B is a pre-trained language model merged via mergekit, combining Lamarck-14B-v0.7 and Qwenvergence-14B-v12-Prose-DS. It ranks first among models with fewer than 15B parameters on the Open LLM Leaderboard.
Large Language Model
Transformers English

L
suayptalha
15.36k
24
Nera Noctis 12B GGUF
Other
Llamacpp imatrix quantized version of Nera_Noctis-12B, based on Nitral-AI/Nera_Noctis-12B model, supporting English text generation tasks.
Large Language Model English
N
bartowski
64
6
Meta Llama 3.1 70B Instruct FP8
FP8 quantized version of Meta-Llama-3.1-70B-Instruct, suitable for multilingual commercial and research purposes, especially ideal for assistant-like chat scenarios.
Large Language Model
Transformers Supports Multiple Languages

M
RedHatAI
71.73k
45
Bielik 7B Instruct V0.1
Bielik-7B-Instruct-v0.1 is a Polish large language model fine-tuned for instructions, based on Bielik-7B-v0.1, developed by the SpeakLeash team in collaboration with ACK Cyfronet AGH, specializing in Polish language understanding and processing tasks.
Large Language Model
Transformers Other

B
speakleash
656
57
Mistral Evolved 11b V0.1 GGUF
Apache-2.0
Quantized version of Mistral-Evolved-11b-v0.1, using llama.cpp for quantization, offering multiple quantization options to suit different needs.
Large Language Model
M
bartowski
1,761
7
Kunoichi DPO V2 7B GGUF Imatrix
A 7B-parameter large language model based on the Mistral architecture, trained with DPO (Direct Preference Optimization), demonstrating excellent performance in multiple benchmarks
Large Language Model
K
Lewdiculous
3,705
39
Quartetanemoi 70B T0.0001
Other
QuartetAnemoi-70B-t0.0001 is a 70B-parameter large language model that merges multiple excellent models through a custom NearSwap algorithm, excelling in storytelling while avoiding clichés.
Large Language Model
Transformers

Q
alchemonaut
16
37
Daringmaid 20B V1.1 GGUF
Daring Maid-20B-V1.1 is an upgraded version based on DaringMaid-20B, with the main update being the replacement of Noromaid-13b from v0.1.1 to v0.3, along with a slight increase in Noromaid's weight to ensure better compatibility.
Large Language Model English
D
Kooten
190
5
Daringmaid 20B
Daring Maid-20B is a text generation model based on the fusion of multiple excellent models, aiming to create a smarter and more instruction-following Noromaid model.
Large Language Model
Transformers English

D
Kooten
163
14
Featured Recommended AI Models