Paligemma 3b Ft Waveui 896
A UI element detection model fine-tuned from PaliGemma 3B 896-resolution weights, specializing in object detection tasks
Downloads 43
Release Time : 7/24/2024
Model Overview
This model is fine-tuned on the WaveUI dataset and excels at UI element detection, serving as a key component in building agent plans
Model Features
High-Precision UI Element Detection
Achieves an average IoU of 0.49 on the test set, significantly outperforming mainstream closed-source models
Optimized for WaveUI Dataset
Trained specifically on a dataset of approximately 80,000 annotated UI elements
896 High-Resolution Support
Fine-tuned from the 896-resolution PaliGemma model, suitable for high-precision detection needs
Model Capabilities
UI Element Detection
Object Detection
Vision-Language Understanding
Use Cases
Agent Development
UI Automation Testing
Automatically identifies and locates UI elements in application interfaces
Improves automation efficiency and accuracy in testing
Intelligent Interaction Agent
Provides interface element recognition capabilities for agents
Enhances agent interaction with graphical interfaces
Featured Recommended AI Models