Llava V1.6 Vicuna 13b Gguf
LLaVA is an open-source multimodal chatbot based on the Transformer architecture, offering various model versions that balance size and quality through quantization techniques.
Downloads 630
Release Time : 2/17/2024
Model Overview
LLaVA is an open-source chatbot trained by fine-tuning LLMs on multimodal instruction-following data, supporting image-to-text and text-to-text tasks.
Model Features
Multimodal Capability
Combines visual and language understanding to handle interactive tasks involving images and text.
Quantization Options
Offers multiple quantization versions from 3-bit to 8-bit, balancing model size and inference quality.
Instruction Following
Fine-tuned with extensive instruction data to better understand and execute complex instructions.
Model Capabilities
Image Understanding
Multimodal Dialogue
Visual Question Answering
Instruction Following
Use Cases
Research
Multimodal Model Research
Used for research in the intersection of computer vision and natural language processing.
Application Development
Intelligent Chatbot
Develop dialogue systems capable of understanding image content.
Featured Recommended AI Models