L

Llava V1.6 Vicuna 13b

Developed by liuhaotian
LLaVA is an open-source multimodal chatbot, fine-tuned on large language models with multimodal instruction-following data.
Downloads 7,080
Release Time : 1/31/2024

Model Overview

LLaVA is an autoregressive language model based on the Transformer architecture, primarily used for researching large multimodal models and chatbots.

Model Features

Multimodal Capability
Combines image and text inputs to generate text outputs.
Instruction Following
Capable of understanding and executing complex multimodal instructions.
Open-source
The model is open-source and available for research and development.

Model Capabilities

Image-Text Understanding
Multimodal Dialogue
Visual Question Answering
Instruction Following

Use Cases

Research
Multimodal Model Research
Used to study the behavior and performance of large multimodal models.
Education
Visual Question Answering System
Build systems capable of answering questions about image content.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase