S

Skywork R1V2 38B

Developed by Skywork
Skywork-R1V2-38B is currently the most advanced open-source multimodal reasoning model, demonstrating outstanding performance in multiple benchmark tests with robust visual reasoning and text comprehension capabilities.
Downloads 1,778
Release Time : 4/25/2025

Model Overview

A high-performance open-source vision-language model combining visual reasoning and text comprehension, leading other open-source models in benchmarks such as MMMU and OlympiadBench.

Model Features

Multimodal Reasoning Capability
Achieved a score of 73.6% in the MMMU test, the highest among all open-source models.
Outstanding Visual Understanding
Reached 62.6% on OlympiadBench, significantly outperforming other open-source models.
Comparable to Commercial Models
Demonstrated strong performance in MathVision, MMMU-Pro, and MathVista tests, approaching the performance of commercial closed-source models.
Open Source Accessibility
Fully open-source, available via Hugging Face and ModelScope model repositories.

Model Capabilities

Multimodal Reasoning
Visual Question Answering
Image Understanding
Complex Problem Solving
Cross-modal Information Processing

Use Cases

Education
Math Problem Solving
Analyze and solve problems containing mathematical formulas and diagrams.
Achieved 74.0% accuracy in the MathVista test.
Science Problem Solving
Understand scientific charts and answer related questions.
Achieved 62.6% accuracy in the OlympiadBench test.
Research
Multimodal Research
Used for cutting-edge research in vision-language models.
Featured Recommended AI Models
ยฉ 2025AIbase