Gemma 3 12B Open-Source Large Language Model - Free Local Deployment, Supports Quantized Versions in GGUF Format

Gemma 3 12b It GGUF

Developed by tensorblock

Gemma 3 12B is a large language model that provides a quantized version in GGUF format, suitable for local deployment and use.

Large Language Model

Transformers

#Large Language Model #Efficient Quantization #Multi-round Dialogue

Downloads 336

Release Time : 3/13/2025

Model Overview

Gemma 3 12B is a large language model developed by Google, offering multiple quantized versions suitable for different hardware environments and performance requirements.

Model Features

Multiple Quantized Versions

It provides multiple quantized versions from Q2_K to Q8_0 to meet different hardware and performance requirements.

Local Deployment

The GGUF format supports local deployment without relying on cloud services.

High Performance

The optimized quantization technology ensures reduced resource consumption while maintaining model performance.

Model Capabilities

Text Generation

Dialogue System

Code Generation

Text Summarization

Use Cases

Natural Language Processing

Dialogue System

Used to build intelligent dialogue robots to provide a natural and smooth interaction experience.

Text Summarization

Automatically generate concise summaries of long texts to improve information acquisition efficiency.

Code Generation

Code Completion

Help developers quickly generate code snippets to improve programming efficiency.

🚀 google/gemma-3-12b-it - GGUF

This repository stores GGUF format model files for google/gemma-3-12b-it. It offers a solution for users to access and utilize this specific model in a compatible format.

🚀 Quick Start

Access Gemma on Hugging Face

To access Gemma on Hugging Face, you're required to review and agree to Google’s usage license. To do this, please ensure you're logged in to Hugging Face and click the button below. Requests are processed immediately.

Model Information

Property	Details
Library Name	transformers
Pipeline Tag	image-text-to-text
Base Model	google/gemma-3-12b-it
Tags	TensorBlock, GGUF
License	gemma

✨ Features

The files were quantized using machines provided by TensorBlock, and they are compatible with llama.cpp as of commit b4882.

📦 Installation

Command line

Firstly, install Huggingface Client:

pip install -U "huggingface_hub[cli]"

Then, download the individual model file to a local directory:

huggingface-cli download tensorblock/gemma-3-12b-it-GGUF --include "gemma-3-12b-it-Q2_K.gguf" --local-dir MY_LOCAL_DIR

If you want to download multiple model files with a pattern (e.g., *Q4_K*gguf), you can try:

huggingface-cli download tensorblock/gemma-3-12b-it-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'

💻 Usage Examples

Prompt template

<bos><start_of_turn>user
{system_prompt}

{prompt}<end_of_turn>
<start_of_turn>model

📚 Documentation

Our projects

Project	Description	Image	Link
Awesome MCP Servers	A comprehensive collection of Model Context Protocol (MCP) servers.		See what we built
TensorBlock Studio	A lightweight, open, and extensible multi-LLM interaction studio.		See what we built

Model file specification

Filename	Quant type	File Size	Description
gemma-3-12b-it-Q2_K.gguf	Q2_K	4.768 GB	smallest, significant quality loss - not recommended for most purposes
gemma-3-12b-it-Q3_K_S.gguf	Q3_K_S	5.458 GB	very small, high quality loss
gemma-3-12b-it-Q3_K_M.gguf	Q3_K_M	6.009 GB	very small, high quality loss
gemma-3-12b-it-Q3_K_L.gguf	Q3_K_L	6.480 GB	small, substantial quality loss
gemma-3-12b-it-Q4_0.gguf	Q4_0	6.887 GB	legacy; small, very high quality loss - prefer using Q3_K_M
gemma-3-12b-it-Q4_K_S.gguf	Q4_K_S	6.935 GB	small, greater quality loss
gemma-3-12b-it-Q4_K_M.gguf	Q4_K_M	7.301 GB	medium, balanced quality - recommended
gemma-3-12b-it-Q5_0.gguf	Q5_0	8.232 GB	legacy; medium, balanced quality - prefer using Q4_K_M
gemma-3-12b-it-Q5_K_S.gguf	Q5_K_S	8.232 GB	large, low quality loss - recommended
gemma-3-12b-it-Q5_K_M.gguf	Q5_K_M	8.445 GB	large, very low quality loss - recommended
gemma-3-12b-it-Q6_K.gguf	Q6_K	9.661 GB	very large, extremely low quality loss
gemma-3-12b-it-Q8_0.gguf	Q8_0	12.510 GB	very large, extremely low quality loss - not recommended

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご