Model Selection

Online repackaging

# Online repackaging

LGAI EXAONE EXAONE 4.0 1.2B GGUF

EXAONE-4.0-1.2B is a 1.2B parameter language model released by LGAI-EXAONE, offering multiple quantization versions to meet different hardware requirements.

Large Language Model

LGAI EXAONE EXAONE 4.0 32B GGUF

The quantized version of the EXAONE-4.0-32B model by LGAI-EXAONE, quantized using the llama.cpp tool, aiming to provide more flexible usage options for users with different hardware conditions.

Large Language Model

Menlo Lucy GGUF

The Lucy model is a large language model developed by Menlo. After quantization, it can reduce resource requirements while ensuring performance and improve operating efficiency.

Large Language Model

Google Medgemma 4b It GGUF

This is the Llamacpp imatrix quantized version of Google's medgemma-4b-it model, offering multiple quantization options suitable for users with different needs.

Large Language Model

Thedrummer Snowpiercer 15B V2 GGUF

This is a quantized version of TheDrummer's Snowpiercer-15B-v2 model, quantized using the llama.cpp tool, offering multiple quantization types to meet different performance and quality requirements.

Large Language Model

Pinkpixel Crystal Think V2 GGUF

This is a quantized version of PinkPixel's Crystal-Think-V2 model, offering multiple quantization types to meet different hardware and performance requirements.

Large Language Model English

Skywork Skywork SWE 32B GGUF

Skywork-SWE-32B is a large language model with 32B parameters. It is quantized by Llamacpp imatrix and can run efficiently in resource-constrained environments.

Large Language Model

Nvidia AceReason Nemotron 1.1 7B GGUF

This is a quantized version of the NVIDIA AceReason - Nemotron - 1.1 - 7B model, which optimizes the model's running efficiency on different hardware while maintaining certain performance and quality.

Large Language Model Supports Multiple Languages

Delta Vector Austral 24B Winton GGUF

A quantized version of the Austral-24B-Winton model of Delta-Vector, quantized using the llama.cpp tool, suitable for efficient operation on different hardware configurations.

Large Language Model English

Sophosympatheia StrawberryLemonade L3 70B V1.0 GGUF

StrawberryLemonade-L3-70B-v1.0 is a quantized large language model designed to run efficiently under different hardware conditions.

Large Language Model English

Akhil Theerthala Kuvera 8B V0.1.0 GGUF

Kuvera-8B is an 8B parameter large language model focused on the fields of finance and personal finance, offering multiple quantization versions to meet different hardware requirements.

Large Language Model English

Microsoft Phi 4 Mini Reasoning GGUF

This is a quantized version of the Microsoft Phi - 4 - mini - reasoning model, which is quantized using the llamacpp tool to improve the model's operating efficiency and performance in different hardware environments.

Large Language Model Supports Multiple Languages

Zed Industries Zeta GGUF

This is the Llamacpp imatrix quantized version of the zeta model from zed-industries, which solves the problem of efficiently running the model under different hardware conditions and provides multiple quantization types for users to choose from.

Large Language Model

Arcee Ai Virtuoso Small V2 GGUF

A quantized version of the arcee-ai/Virtuoso-Small-v2 model based on llama.cpp, offering multiple quantization types to meet different hardware and performance requirements.

Large Language Model

L3.3 MS Nevoria 70b GGUF

A quantized version based on the Steelskull/L3.3-MS-Nevoria-70b model, using llama.cpp for imatrix quantization, supporting multiple quantization levels for different hardware environments.

Large Language Model

Featured Recommended AI Models

Qwen2.5 VL 7B Abliterated Caption It I1 GGUF

Quantized version of Qwen2.5-VL-7B-Abliterated-Caption-it, supporting multilingual image description tasks.

Transformers Supports Multiple Languages

Nunchaku Flux.1 Dev Colossus

The Nunchaku quantized version of the Colossus Project Flux, designed to generate high-quality images based on text prompts. This model minimizes performance loss while optimizing inference efficiency.

Image Generation English

Qwen2.5 VL 7B Abliterated Caption It GGUF

This is a static quantized version based on the Qwen2.5-VL-7B model, focusing on image captioning generation tasks and supporting multiple languages.

Transformers Supports Multiple Languages

Olmocr 7B 0725 FP8

olmOCR-7B-0725-FP8 is a document OCR model based on the Qwen2.5-VL-7B-Instruct model. It is fine-tuned using the olmOCR-mix-0225 dataset and then quantized to the FP8 version.

Transformers English

Lucy-128k is a model developed based on Qwen3-1.7B, focusing on proxy-based web search and lightweight browsing, and can run efficiently on mobile devices.

Large Language Model

Transformers English

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase