# Vision-Language Unified Model
GOT CPU
Apache-2.0
GOT-OCR2.0 is a multilingual general OCR model that employs an end-to-end architecture to achieve advanced text recognition capabilities.
Image-to-Text
Transformers Other

G
srimanth-d
960
11
Florence 2 Base Ft
MIT
Florence-2 is an advanced vision foundation model developed by Microsoft, employing a prompt-based approach to handle a wide range of vision and vision-language tasks.
Image-to-Text
Transformers

F
lodestones
14
0
Featured Recommended AI Models