# Online repackaging
LGAI EXAONE EXAONE 4.0 1.2B GGUF
EXAONE-4.0-1.2B is a 1.2B parameter language model released by LGAI-EXAONE, offering multiple quantization versions to meet different hardware requirements.
Large Language Model
L
bartowski
402
1
LGAI EXAONE EXAONE 4.0 32B GGUF
The quantized version of the EXAONE-4.0-32B model by LGAI-EXAONE, quantized using the llama.cpp tool, aiming to provide more flexible usage options for users with different hardware conditions.
Large Language Model
L
bartowski
708
2
Menlo Lucy GGUF
The Lucy model is a large language model developed by Menlo. After quantization, it can reduce resource requirements while ensuring performance and improve operating efficiency.
Large Language Model
M
bartowski
674
3
Google Medgemma 4b It GGUF
Other
This is the Llamacpp imatrix quantized version of Google's medgemma-4b-it model, offering multiple quantization options suitable for users with different needs.
Large Language Model
G
bartowski
348
3
Thedrummer Snowpiercer 15B V2 GGUF
MIT
This is a quantized version of TheDrummer's Snowpiercer-15B-v2 model, quantized using the llama.cpp tool, offering multiple quantization types to meet different performance and quality requirements.
Large Language Model
T
bartowski
1,235
1
Pinkpixel Crystal Think V2 GGUF
Apache-2.0
This is a quantized version of PinkPixel's Crystal-Think-V2 model, offering multiple quantization types to meet different hardware and performance requirements.
Large Language Model English
P
bartowski
128
1
Skywork Skywork SWE 32B GGUF
Apache-2.0
Skywork-SWE-32B is a large language model with 32B parameters. It is quantized by Llamacpp imatrix and can run efficiently in resource-constrained environments.
Large Language Model
S
bartowski
921
2
Nvidia AceReason Nemotron 1.1 7B GGUF
Other
This is a quantized version of the NVIDIA AceReason - Nemotron - 1.1 - 7B model, which optimizes the model's running efficiency on different hardware while maintaining certain performance and quality.
Large Language Model Supports Multiple Languages
N
bartowski
1,303
1
Delta Vector Austral 24B Winton GGUF
Apache-2.0
A quantized version of the Austral-24B-Winton model of Delta-Vector, quantized using the llama.cpp tool, suitable for efficient operation on different hardware configurations.
Large Language Model English
D
bartowski
421
1
Sophosympatheia StrawberryLemonade L3 70B V1.0 GGUF
StrawberryLemonade-L3-70B-v1.0 is a quantized large language model designed to run efficiently under different hardware conditions.
Large Language Model English
S
bartowski
1,406
1
Akhil Theerthala Kuvera 8B V0.1.0 GGUF
MIT
Kuvera-8B is an 8B parameter large language model focused on the fields of finance and personal finance, offering multiple quantization versions to meet different hardware requirements.
Large Language Model English
A
bartowski
793
1
Microsoft Phi 4 Mini Reasoning GGUF
MIT
This is a quantized version of the Microsoft Phi - 4 - mini - reasoning model, which is quantized using the llamacpp tool to improve the model's operating efficiency and performance in different hardware environments.
Large Language Model Supports Multiple Languages
M
bartowski
1,667
7
Zed Industries Zeta GGUF
Apache-2.0
This is the Llamacpp imatrix quantized version of the zeta model from zed-industries, which solves the problem of efficiently running the model under different hardware conditions and provides multiple quantization types for users to choose from.
Large Language Model
Z
bartowski
561
12
Arcee Ai Virtuoso Small V2 GGUF
Apache-2.0
A quantized version of the arcee-ai/Virtuoso-Small-v2 model based on llama.cpp, offering multiple quantization types to meet different hardware and performance requirements.
Large Language Model
A
bartowski
1,976
10
L3.3 MS Nevoria 70b GGUF
A quantized version based on the Steelskull/L3.3-MS-Nevoria-70b model, using llama.cpp for imatrix quantization, supporting multiple quantization levels for different hardware environments.
Large Language Model
L
bartowski
5,252
12
Featured Recommended AI Models
Qwen2.5 VL 7B Abliterated Caption It I1 GGUF
Apache-2.0
Quantized version of Qwen2.5-VL-7B-Abliterated-Caption-it, supporting multilingual image description tasks.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
167
1
Nunchaku Flux.1 Dev Colossus
Other
The Nunchaku quantized version of the Colossus Project Flux, designed to generate high-quality images based on text prompts. This model minimizes performance loss while optimizing inference efficiency.
Image Generation English
N
nunchaku-tech
235
3
Qwen2.5 VL 7B Abliterated Caption It GGUF
Apache-2.0
This is a static quantized version based on the Qwen2.5-VL-7B model, focusing on image captioning generation tasks and supporting multiple languages.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
133
1
Olmocr 7B 0725 FP8
Apache-2.0
olmOCR-7B-0725-FP8 is a document OCR model based on the Qwen2.5-VL-7B-Instruct model. It is fine-tuned using the olmOCR-mix-0225 dataset and then quantized to the FP8 version.
Image-to-Text
Transformers English

O
allenai
881
3
Lucy 128k GGUF
Apache-2.0
Lucy-128k is a model developed based on Qwen3-1.7B, focusing on proxy-based web search and lightweight browsing, and can run efficiently on mobile devices.
Large Language Model
Transformers English

L
Mungert
263
2