S

Skywork VL Reward 7B

Developed by Skywork
Skywork-VL-Reward-7B is a 7B-parameter multimodal reward model based on the Qwen2.5-VL-7B-Instruct architecture, enhanced with a value head structure for training reward models.
Downloads 30
Release Time : 4/25/2025

Model Overview

This is an efficient multimodal understanding and reasoning reward model designed to support multimodal reinforcement learning.

Model Features

Multimodal Understanding
Capable of processing both image and text information for multimodal understanding and reasoning.
High Efficiency
Achieved SOTA results on VL-RewardBench and RewardBench.
Open-Source Contribution
Provides a powerful multimodal reward model for the open-source community.

Model Capabilities

Multimodal Understanding
Image-Text Analysis
Reward Model Training

Use Cases

Multimodal Reinforcement Learning
Multimodal Reward Model Training
Used for training multimodal reinforcement learning models by providing reward signals.
Achieved a SOTA score of 73.1 on VL-RewardBench.
Image-Text Understanding
Image-Text Analysis
Analyzes combined image and text information to provide understanding and reasoning capabilities.
Scored a high 90.1 on RewardBench.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase