L

Llava V1.6 34B Gguf

Developed by cjpais
LLaVA 1.6 34B is an open-source multimodal chatbot model developed by fine-tuning a large language model on multimodal instruction-following data. It supports image-to-text and text-to-text generation tasks.
Downloads 1,965
Release Time : 2/1/2024

Model Overview

LLaVA is an autoregressive language model based on the Transformer architecture, primarily used for academic research in multimodal large models and chatbots.

Model Features

Multimodal Support
Capable of processing both image and text inputs to generate text outputs
Multiple Quantization Versions
Offers various quantization versions from 3-bit to 8-bit to meet different hardware requirements
High-Quality Fine-Tuning
Fine-tuned on extensive multimodal instruction-following data

Model Capabilities

Image Understanding
Multimodal Dialogue
Visual Question Answering
Image Caption Generation

Use Cases

Academic Research
Multimodal Model Research
Used for research in the intersection of computer vision and natural language processing
Application Development
Intelligent Chatbot
Develop dialogue systems capable of understanding image content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase