Chameleon 7b
Meta Chameleon is a hybrid-modality early-fusion foundational model developed by FAIR, supporting multimodal processing of images and text.
Downloads 20.97k
Release Time : 3/26/2024
Model Overview
Chameleon is a hybrid-modality early-fusion foundational model capable of processing joint inputs of images and text, suitable for multimodal understanding and generation tasks.
Model Features
Hybrid Modality Early Fusion
Supports early fusion processing of images and text for better multimodal understanding.
Multi-scale Parameter Sizes
Offers model options with 7B and 30B parameter scales.
Research-oriented
Focuses on advancing research in multimodal foundational models.
Model Capabilities
Image Understanding
Text Generation
Multimodal Reasoning
Cross-modal Information Fusion
Use Cases
Multimodal Research
Visual Question Answering
Question answering system based on image and text inputs.
Image Caption Generation
Generates textual descriptions based on image content.
Featured Recommended AI Models