S

Skywork O1 Open PRM Qwen 2.5 1.5B

Developed by Skywork
Skywork o1 Open-PRM-Qwen-2.5-1.5B is an incremental process reward model trained on Qwen2.5-Math-1.5B-Instruct, specifically designed to enhance small-scale complex problem-solving capabilities.
Downloads 4,368
Release Time : 11/26/2024

Model Overview

This model is part of the Skywork o1 open model series, focusing on enhancing reasoning abilities through incremental process rewards, particularly suitable for math and code-related reasoning tasks.

Model Features

Incremental Process Reward
Enhances reasoning capabilities through PRM mechanisms, particularly suitable for complex problem-solving.
Small-Scale Efficiency
Achieves excellent reasoning performance with a 1.5B parameter scale.
Mathematical Reasoning Ability
Performs exceptionally well on math datasets such as GSM8K and MATH.

Model Capabilities

Mathematical problem solving
Code generation
Complex reasoning
Multi-step problem solving

Use Cases

Education
Math Problem Solving
Helps students solve math problems by providing detailed step-by-step solutions.
Achieves 94.5% accuracy on the GSM8K dataset.
Programming
Code Generation and Optimization
Generates code solutions based on problem descriptions.
Performs well on code datasets such as HumanEval.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase