G

Gemma 3 12b It Qat Compressed Tensors

Developed by gaunernst
Gemma 3 is Google's lightweight cutting-edge open model family, built on the same research and technology used to create Gemini models. This model is multimodal, capable of processing both text and image inputs to generate text outputs.
Downloads 867
Release Time : 4/8/2025

Model Overview

The Gemma 3 12B model is an instruction-tuned version employing Quantization-Aware Training (QAT) and compressed tensor formats, significantly reducing memory requirements while maintaining quality comparable to bfloat16. Suitable for various text generation and image understanding tasks.

Model Features

Multimodal capability
Can process both text and image inputs to generate text outputs
Large context window
Supports context lengths of up to 128K tokens
Quantization-aware training
Uses QAT technology to reduce memory requirements while preserving model quality
Multilingual support
Supports processing in over 140 languages

Model Capabilities

Text generation
Image content analysis
Multilingual processing
Question answering systems
Document summarization
Logical reasoning

Use Cases

Content generation
Poetry creation
Generates poems based on user prompts
Can produce creative poems aligned with given themes
Document summarization
Automatically generates concise summaries of long documents
Accurately extracts key information
Visual understanding
Image captioning
Analyzes image content and generates textual descriptions
Accurately identifies main elements and scenes in images
Education
Math problem solving
Solves mathematical problems and logical reasoning
Achieves 82.6 score on GSM8K benchmark
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase