Llava 1.6 Mistral 7b Gguf
LLaVA is an open-source multimodal chatbot, trained by fine-tuning LLM on multimodal instruction-following data. This version is the GGUF quantized version, offering multiple quantization options.
Downloads 9,652
Release Time : 2/1/2024
Model Overview
A multimodal model based on Mistral-7B-Instruct-v0.2, supporting image and text inputs to generate text outputs. Primarily used for research on large multimodal models and chatbots.
Model Features
Multimodal Capability
Supports processing both image and text inputs to generate relevant text outputs
Multiple Quantization Options
Offers various quantization versions from 3-bit to 8-bit to meet different hardware requirements
Optimized Projector
Updated quantization parameters and projector to improve model performance
Model Capabilities
Image Understanding
Multimodal Dialogue
Visual Question Answering
Instruction Following
Use Cases
Research
Multimodal Model Research
Used for research at the intersection of computer vision and natural language processing
Chatbot Development
Develop intelligent dialogue systems capable of understanding image content
Education
Visual-Assisted Learning
Helps students understand complex concepts through images
Featured Recommended AI Models
Š 2025AIbase