G

Gemma 3 4b Pt

Developed by axolotl-mirrors
Gemma 3 is a lightweight, state-of-the-art open model family launched by Google, built on the same research and technology as the Gemini model. It supports multimodality, can process text and image inputs and generate text outputs, and is suitable for a variety of text generation and image understanding tasks.
Downloads 4,332
Release Time : 3/30/2025

Model Overview

Gemma 3 is a multimodal model that can process text and image inputs and generate text outputs, suitable for a variety of text generation and image understanding tasks.

Model Features

Multimodal processing
Can process text and image inputs and generate text outputs.
Large context window
Has a large context window of 128K and supports over 140 languages.
Resource-friendly
Relatively small model size, can be deployed in resource-constrained environments such as laptops, desktops, or self-owned cloud infrastructure.

Model Capabilities

Text generation
Image understanding
Multilingual support
Multimodal processing

Use Cases

Text generation
Text summarization
Generate a summary of the text.
Question-answering system
Answer questions raised by users.
Image understanding
Image description
Generate a text description of the image.
Scored 116 in the COCOcap benchmark test.
Document understanding
Understand the content in the document.
Scored 85.6 in the DocVQA benchmark test.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase