AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Vision-Language Unified Model

# Vision-Language Unified Model

GOT CPU
Apache-2.0
GOT-OCR2.0 is a multilingual general OCR model that employs an end-to-end architecture to achieve advanced text recognition capabilities.
Image-to-Text Transformers Other
G
srimanth-d
960
11
Florence 2 Base Ft
MIT
Florence-2 is an advanced vision foundation model developed by Microsoft, employing a prompt-based approach to handle a wide range of vision and vision-language tasks.
Image-to-Text Transformers
F
lodestones
14
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase