# Open-source Chatbot
Spydaz Web AI Llava
LLaVa is an open-source multimodal chatbot, fine-tuned on GPT-generated multimodal instruction-following data based on LLaMA/Vicuna, specifically optimized for chat/instruction-following as a multimodal version of LLM.
Image-to-Text
Transformers Supports Multiple Languages

_
LeroyDyer
30
1
Llava NeXT Video 7B DPO Hf
LLaVA-NeXT-Video is an open-source multimodal chatbot optimized through mixed training on video and image data, possessing excellent video understanding capabilities.
Video-to-Text
Transformers English

L
llava-hf
12.61k
9
Denseconnector V1.5 8B
DenseConnector is an open-source chatbot, fine-tuned based on LLaMA/Vicuna and trained using GPT-generated multimodal instruction-following data.
Image-to-Text
Transformers

D
HuanjinYao
17
7
Vsft Llava 1.5 7b Hf Trl
A multimodal vision-language model based on LLaVA-1.5-7B trained through Visual Supervised Fine-Tuning (VSFT), supporting image understanding and dialogue generation
Image-to-Text
Transformers English

V
HuggingFaceH4
65
14
Llava V1.5 7b Gguf
LLaVA is an open-source multimodal chatbot, fine-tuned on LLaMA/Vicuna and trained with GPT-generated multimodal instruction-following data.
Image-to-Text
L
granddad
13
0
Llava V1.6 Vicuna 7b
LLaVA is an open-source multimodal chatbot, fine-tuned on large language models using multimodal instruction-following data.
Text-to-Image
Transformers

L
liuhaotian
31.65k
123
Llava V1.6 34b
Apache-2.0
LLaVA is an open-source multimodal chatbot, fine-tuned based on a large language model, supporting interactions with both images and text.
Text-to-Image
L
liuhaotian
9,033
351
Llava Phi 2 3b
MIT
LLaVa-Phi-2-3B is an open-source multimodal chatbot model, fine-tuned based on the Phi-2 architecture, capable of processing image and text inputs to generate natural language responses.
Text-to-Image
Transformers English

L
marianna13
153
13
Llava V1.5 13b Lora
LLaVA is an open-source multimodal chatbot, fine-tuned from LLaMA/Vicuna and trained on GPT-generated multimodal instruction-following data.
Text-to-Image
Transformers

L
liuhaotian
143
26
Llava V1.5 13B AWQ
LLaVA is an open-source multimodal chatbot, fine-tuned on GPT-generated multimodal instruction-following data based on LLaMA/Vicuna.
Text-to-Image
Transformers

L
TheBloke
141
35
Llava V1.5 Mlp2x 336px Pretrain Vicuna 13b V1.5
LLaVA is an open-source multimodal chatbot, fine-tuned on GPT-generated multimodal instruction-following data based on LLaMA/Vicuna.
Text-to-Image
Transformers

L
liuhaotian
66
2
Llava V1.5 Mlp2x 336px Pretrain Vicuna 7b V1.5
LLaVA is an open-source multimodal chatbot, fine-tuned based on LLaMA/Vicuna and trained with GPT-generated multimodal instruction-following data.
Text-to-Image
Transformers

L
liuhaotian
173
17
Llava V1.5 7b
LLaVA is an open-source multimodal chatbot, fine-tuned based on LLaMA/Vicuna, supporting image-text interaction.
Image-to-Text
Transformers

L
liuhaotian
1.4M
448
Llava Pretrain Vicuna 7b V1.3
LLaVA is an open-source multimodal chatbot, fine-tuned on GPT-generated multimodal instruction-following data based on LLaMA/Vicuna.
Text-to-Image
Transformers

L
liuhaotian
54
1
Llava Llama 2 7b Chat Lightning Lora Preview
LLaVA is an open-source multimodal chatbot, fine-tuned based on LLaMA/Vicuna and trained with GPT-generated multimodal instruction-following data.
Text-to-Image
Transformers

L
liuhaotian
251
12
Featured Recommended AI Models