INFRL Qwen2.5 VL 72B Preview Ggufs Fully Quantized
An improved vision-language model based on Qwen2.5-VL-72B-Instruct, excelling in multiple visual reasoning benchmarks
Downloads 230
Release Time : 5/14/2025
Model Overview
A multimodal model with enhanced visual reasoning capabilities, achieving the best performance among open-source models in mathematical visual understanding tasks
Model Features
Exceptional Visual Reasoning Capabilities
Top performance in visual reasoning benchmarks such as MathVision, MathVista, and MathVerse
Reinforcement Learning Optimization
Utilizes rule-based reward reinforcement learning to enhance visual comprehension
Multimodal Understanding
Capable of processing both visual and linguistic information for complex cross-modal reasoning
Model Capabilities
Visual Question Answering
Mathematical Problem Visual Understanding
Chart Analysis
Cross-modal Reasoning
Use Cases
EdTech
Visual Solution for Math Problems
Analyzing math problems containing diagrams and formulas
Achieved 77.8% accuracy on the MathVista test set
Scientific Research
Scientific Chart Analysis
Understanding and interpreting complex charts in research papers
Featured Recommended AI Models