Llava V1.6 34B Gguf
LLaVA 1.6 34B is an open-source multimodal chatbot model developed by fine-tuning a large language model on multimodal instruction-following data. It supports image-to-text and text-to-text generation tasks.
Downloads 1,965
Release Time : 2/1/2024
Model Overview
LLaVA is an autoregressive language model based on the Transformer architecture, primarily used for academic research in multimodal large models and chatbots.
Model Features
Multimodal Support
Capable of processing both image and text inputs to generate text outputs
Multiple Quantization Versions
Offers various quantization versions from 3-bit to 8-bit to meet different hardware requirements
High-Quality Fine-Tuning
Fine-tuned on extensive multimodal instruction-following data
Model Capabilities
Image Understanding
Multimodal Dialogue
Visual Question Answering
Image Caption Generation
Use Cases
Academic Research
Multimodal Model Research
Used for research in the intersection of computer vision and natural language processing
Application Development
Intelligent Chatbot
Develop dialogue systems capable of understanding image content
Featured Recommended AI Models
Š 2025AIbase