# Robotic Vision
Spaceqwen2.5 VL 3B Instruct
Apache-2.0
A multimodal vision-language model fine-tuned based on Qwen2.5-VL-3B-Instruct, focusing on spatial reasoning capabilities
Text-to-Image English
S
remyxai
7,446
7
Euclid Convnext Xxlarge 120524
Apache-2.0
A multimodal large language model specifically trained to enhance low-level geometric perception, improving geometric analysis capabilities through high-fidelity synthetic visual descriptions
Text-to-Image
Transformers English

E
euclid-multimodal
22
4
Featured Recommended AI Models