I

INFRL Qwen2.5 VL 72B Preview Q8 With Bf16 Output And Bf16 Embedding.gguf

Developed by GeorgyGUF
An improved multimodal vision-language model based on Qwen2.5-VL-72B-Instruct, excelling in multiple visual reasoning benchmarks
Downloads 64
Release Time : 5/10/2025

Model Overview

A multimodal model with enhanced visual reasoning capabilities, achieving state-of-the-art performance among open-source models in mathematical visual understanding tasks

Model Features

Exceptional Visual Reasoning Capabilities
Top performance in visual reasoning benchmarks such as MathVision, MathVista, and MathVerse
Reinforcement Learning Optimization
Utilizes rule-based reward reinforcement learning to enhance model performance
Leader Among Open-source Models
Outperforms commercial models like GPT4o and Gemini in multiple visual reasoning tests

Model Capabilities

Visual Question Answering
Mathematical Problem Visual Understanding
Multimodal Reasoning
Image Content Analysis

Use Cases

EdTech
Visual Math Problem Solving
Solving math problems containing diagrams and formulas
Achieved 77.8% accuracy on the MathVista test set
Research Evaluation
Vision-Language Model Benchmarking
Used to evaluate visual reasoning capabilities of multimodal models
Provides an evaluation framework consistent with LLM-Judge
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase