L

Llava V1.6 Vicuna 13b Gguf

Developed by cjpais
LLaVA is an open-source multimodal chatbot based on the Transformer architecture, offering various model versions that balance size and quality through quantization techniques.
Downloads 630
Release Time : 2/17/2024

Model Overview

LLaVA is an open-source chatbot trained by fine-tuning LLMs on multimodal instruction-following data, supporting image-to-text and text-to-text tasks.

Model Features

Multimodal Capability
Combines visual and language understanding to handle interactive tasks involving images and text.
Quantization Options
Offers multiple quantization versions from 3-bit to 8-bit, balancing model size and inference quality.
Instruction Following
Fine-tuned with extensive instruction data to better understand and execute complex instructions.

Model Capabilities

Image Understanding
Multimodal Dialogue
Visual Question Answering
Instruction Following

Use Cases

Research
Multimodal Model Research
Used for research in the intersection of computer vision and natural language processing.
Application Development
Intelligent Chatbot
Develop dialogue systems capable of understanding image content.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase