G

Gemma 3n E2B

Developed by google
Gemma 3n is a lightweight and state - of - the - art open - source model family launched by Google, supporting multimodal input and output.
Downloads 206
Release Time : 6/12/2025

Model Overview

Gemma 3n is a lightweight open - source model built on the same research and technology as the Gemini model, supporting text, audio, and visual (image and video) input, suitable for various tasks and data formats.

Model Features

Multimodal support
Capable of processing text, image, video, and audio input and generating text output.
Architectural innovation
Uses the MatFormer architecture, allowing nested sub - models in the E4B model.
Resource - efficient
By offloading low - utilization matrices from the accelerator, the model's memory footprint is comparable to that of a traditional 2B model.

Model Capabilities

Text generation
Image analysis
Video analysis
Audio analysis
Multimodal reasoning

Use Cases

Content creation
Image description generation
Generate detailed text descriptions based on the input image.
Generate accurate and detailed image descriptions.
Research and education
Multimodal learning
Utilize multimodal input for educational and research tasks.
Improve the efficiency of learning and research.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase