E

Euclid Convnext Xxlarge 120524

Developed by euclid-multimodal
A multimodal large language model specifically trained to enhance low-level geometric perception, improving geometric analysis capabilities through high-fidelity synthetic visual descriptions
Downloads 22
Release Time : 12/3/2024

Model Overview

A multimodal model combining ConvNeXt visual encoder with Qwen-2.5 language model, trained on 1.6 million synthetic geometric images and Q&A pairs, excelling in precise geometric relationship detection and analysis

Model Features

High-fidelity Geometric Perception
Trained on synthetic geometric images with precise Q&A annotations, achieving millimeter-level geometric relationship recognition
Curriculum Learning Architecture
Adopts progressive training strategy, gradually improving model capabilities from simple geometric elements to complex relationships
Multimodal Fusion
Innovatively aligns ConvNeXt visual features with language model through two-layer MLP

Model Capabilities

Point-line relationship detection
Point-circle relationship detection
Angle classification
Length comparison
Geometric annotation understanding
Geometric proof verification
Geometric equation solving

Use Cases

Industrial Inspection
Mechanical Part Dimension Measurement
Automatically detects key dimensional relationships in part drawings
Achieves 90.82% accuracy in length comparison tasks
Medical Imaging
Anatomical Structure Analysis
Identifies geometric features of organs in medical images
EdTech
Geometry Proof Assistance
Verifies steps in student-submitted geometric proofs
Achieves 70.52% accuracy in proof verification tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase