Cambrian-8b Open-Source Multimodal Large Language Model - Empowering Diverse Scenario Applications with Vision at its Core

Cambrian 8b

Developed by nyu-visionx

Cambrian is an open-source multimodal LLM (Large Language Model) designed with a vision-centric approach.

Downloads 565

Release Time : 6/16/2024

Model Overview

Cambrian is a multimodal large language model focused on vision tasks, supporting joint processing of images and text.

Multimodal Capability

Supports joint processing of images and text, capable of understanding and generating multimodal content.

Open-source

The model is fully open-source, adhering to the LLAMA 3 Community License.

Large-scale Training Data

Trained using 2.5M aligned data and 7M carefully selected instruction tuning data.

Image Understanding

Text Generation

Multimodal Reasoning

Visual Question Answering

Education

Visual Question Answering

Answers questions based on image content, suitable for interactive learning in educational settings.

Can accurately understand image content and generate relevant answers.

Content Generation

Multimodal Content Creation

Generates descriptive text based on images or image descriptions based on text.

Generated content is highly relevant to the input image or text.

Property	Details
Model Type	Cambrian is an open - source Multimodal LLM with vision - centric designs.
Model Date	Cambrian - 1 - 8B was trained in June 2024.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base