AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
GUI positioning

# GUI positioning

GUI Actor 2B Qwen2 VL
MIT
GUI-Actor-2B is a vision-language model based on Qwen2-VL-2B, specifically designed for graphical user interface (GUI) positioning tasks. By adding an attention-based action head and fine-tuning, it performs well in multiple GUI positioning benchmark tests.
Text-to-Image Transformers
G
microsoft
163
9
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase