G

Gemma 3n E4B It 4bit MLX

Developed by NexaAI
Gemma 3n is a multimodal lightweight open-source model based on the Google Gemma model, supporting text, image, video, and audio inputs. It is optimized for low-resource devices.
Downloads 122
Release Time : 7/13/2025

Model Overview

Gemma 3n is a lightweight open-source model launched by Google, using the same technology as Gemini. It supports multimodal inputs and text outputs, suitable for low-resource devices.

Model Features

Multimodal support
Capable of processing text, image, audio, and video inputs and generating text outputs.
Low-resource optimization
Using selective parameter activation technology to reduce resource requirements and suitable for running on low-resource devices.
Efficient parameter management
Runs with an effective scale of 2 billion and 4 billion parameters, lower than the total number of parameters.
Multilingual support
Trained with data in over 140 spoken languages, with strong multilingual processing capabilities.

Model Capabilities

Text generation
Image content analysis
Audio data processing
Video content understanding
Multilingual text processing

Use Cases

Content generation
Document summarization
Input a long document and generate a concise summary.
Efficiently generate accurate and coherent summaries.
Question answering
Input a question and generate a detailed answer.
Performs excellently in multiple benchmark tests.
Multimodal analysis
Image description generation
Input an image and generate a detailed text description.
Supports multiple resolutions and generates high-quality descriptions.
Audio transcription
Input audio data and generate a text transcription.
Encodes 6.25 tokens per second, supporting mono.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase