# Open Weights
Openvision Vit Huge Patch14 84
Apache-2.0
OpenVision is a fully open, cost-effective family of advanced vision encoders designed for multimodal learning.
Image Classification
Transformers

O
UCSC-VLAA
19
0
Idefics2 8b
Apache-2.0
Idefics2 is an open-source multimodal model capable of accepting arbitrary sequences of image and text inputs to generate text outputs. It shows significant improvements in OCR, document understanding, and visual reasoning.
Image-to-Text
Transformers English

I
HuggingFaceM4
14.99k
603
Featured Recommended AI Models