Model Selection

Multimodal Text-Image Understanding

# Multimodal Text-Image Understanding

Gemma 3 12b It Qat Q4 0 Gguf

Gemma 3 is Google's lightweight cutting-edge open-source multimodal model supporting image-text input and text output, featuring a 128K context window and 140+ language support.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase