# Scene Graph Generation
Llava SpaceSGG
Apache-2.0
LLaVA-SpaceSGG is a visual question-answering model based on LLaVA-v1.5-13b, focusing on scene graph generation tasks. It can understand image content and generate structured scene descriptions.
Text-to-Image English
L
wumengyangok
36
0
Vinvl Base Image Captioning
Apache-2.0
Microsoft's VinVL foundational pre-trained model, specifically designed for image captioning tasks, with strong visual-language understanding capabilities.
Image-to-Text
V
michelecafagna26
45
1
Featured Recommended AI Models