C

Cambrian 8b

Developed by nyu-visionx
Cambrian is an open-source multimodal LLM (Large Language Model) designed with a vision-centric approach.
Downloads 565
Release Time : 6/16/2024

Model Overview

Cambrian is a multimodal large language model focused on vision tasks, supporting joint processing of images and text.

Model Features

Multimodal Capability
Supports joint processing of images and text, capable of understanding and generating multimodal content.
Open-source
The model is fully open-source, adhering to the LLAMA 3 Community License.
Large-scale Training Data
Trained using 2.5M aligned data and 7M carefully selected instruction tuning data.

Model Capabilities

Image Understanding
Text Generation
Multimodal Reasoning
Visual Question Answering

Use Cases

Education
Visual Question Answering
Answers questions based on image content, suitable for interactive learning in educational settings.
Can accurately understand image content and generate relevant answers.
Content Generation
Multimodal Content Creation
Generates descriptive text based on images or image descriptions based on text.
Generated content is highly relevant to the input image or text.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase