llava-v1.5-7b Open-source Multimodal Chatbot - Free Experience of Image-text Interactive Dialogue

Home

Llava V1.5 7b

Developed by liuhaotian

LLaVA is an open-source multimodal chatbot, fine-tuned based on LLaMA/Vicuna, supporting image-text interaction.

Image-to-Text

Transformers

#Multimodal Instruction Following #Academic VQA Benchmark #Open-source Chatbot

Downloads 1.4M

Release Time : 10/5/2023

Model Overview

An open-source chatbot trained with GPT-generated multimodal instruction-following data through fine-tuning LLaMA/Vicuna, equipped with image-text understanding and generation capabilities.

Model Features

Multimodal Understanding

Processes both image and text inputs for cross-modal interaction.

Instruction Following

Capable of understanding and executing complex multimodal instructions.

Open-source Fine-tuning

Based on an open-source model architecture, supports further customization and optimization.

Model Capabilities

Image caption generation

Visual Question Answering

Multimodal dialogue

Instruction following

Cross-modal reasoning

Use Cases

Academic Research

Multimodal Model Research

Used to explore joint visual-language representation learning.

Intelligent Assistant

Image-Text Interactive Assistant

Builds dialogue systems capable of understanding image content.

Property	Details
Model Type	LLaVA is an open - source chatbot trained by fine - tuning LLaMA/Vicuna on GPT - generated multimodal instruction - following data. It is an auto - regressive language model, based on the transformer architecture.
Model Date	LLaVA - v1.5 - 7B was trained in September 2023.
Paper or Resources for More Information	[https://llava - vl.github.io/](https://llava - vl.github.io/)

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Llava V1.5 7b

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 LLaVA Model Card

🚀 Quick Start

✨ Features

📚 Documentation

Model Details

License

Intended Use

Primary Intended Uses

Primary Intended Users

Training Dataset

Evaluation Dataset