Model Selection

Extreme quantization optimization

# Extreme quantization optimization

Minicpm4 8B GGUF

MiniCPM4-8B is an efficient large language model designed specifically for edge devices. Through innovations in four dimensions: model architecture, training data, training algorithms, and inference systems, it achieves extreme efficiency improvements.

Large Language Model

Transformers Supports Multiple Languages

Gemma 3 1b It MAX HORROR Imatrix GGUF

A horror-optimized version based on Google's Gemma-3 model, featuring extreme quantization technology and horror enhancement matrix, supporting a 32k context window

Large Language Model

Gemma 3 12b It MAX HORROR Imatrix GGUF

A horror-style instruction-tuned version based on Google's Gemma-3 model, featuring Neo Imatrix technology and extreme quantization, supporting 128k context length

Large Language Model

Gemma 3 4b It MAX NEO Imatrix GGUF

An extreme quantization version based on Google's Gemma-3 model, enhanced with NEO Imatrix technology, supporting 128k context length and suitable for full-scenario tasks

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase