L

Llava V1.6 Vicuna 7b Gguf

Developed by cjpais
LLaVA is an open-source multimodal chatbot trained by fine-tuning LLM on multimodal instruction-following data. This version is the GGUF quantized version, offering multiple quantization options.
Downloads 493
Release Time : 2/17/2024

Model Overview

LLaVA is an autoregressive language model based on the Transformer architecture, capable of processing both image and text inputs to generate text outputs. Primarily used for research on large multimodal models and chatbots.

Model Features

Multimodal Capability
Can process both image and text inputs to generate relevant text outputs
Multiple Quantization Options
Offers various quantized versions from 3-bit to 8-bit to meet different hardware and performance needs
Open Source
Licensed under Apache-2.0, allowing free use and modification

Model Capabilities

Image Understanding
Text Generation
Multimodal Dialogue
Visual Question Answering

Use Cases

Research
Multimodal Model Research
Used for research at the intersection of computer vision and natural language processing
Application Development
Intelligent Chatbot
Develop dialogue systems capable of understanding image content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase