N

Nemotron Research Reasoning Qwen 1.5B

Developed by nvidia
An open-source weight model with 1.5 billion parameters, specifically designed for complex reasoning tasks, and performs excellently in fields such as mathematics, coding, science, and logical puzzles.
Downloads 1,236
Release Time : 5/28/2025

Model Overview

The Naimotlang Research Reasoning Model Qwen-1.5B is a leading open-source weight model with 1.5 billion parameters, specifically designed for complex reasoning tasks. It is trained on diverse datasets using the ProRL algorithm and performs excellently in fields such as mathematics, coding, science, and logical puzzles.

Model Features

ProRL algorithm
Extend the reinforcement learning training cycle to support more than 2000 training steps and conduct in-depth exploration of reasoning strategies.
Group Relative Policy Optimization (GRPO)
Introduce three key technologies: entropy collapse mitigation, decoupled clipping and Dynamic Adaptive Policy Optimization (DAPO), KL regularization, and reference policy reset.
Excellent reasoning ability
Perform excellently in tasks such as mathematics, coding, STEM reasoning, logical puzzles, and instruction following, significantly outperforming similar models.

Model Capabilities

Mathematical problem solving
Coding challenges
Scientific problem reasoning
Logical puzzle solving
STEM reasoning
Instruction following

Use Cases

Education
Mathematical competition problem solving
Used to solve mathematical competition problems such as AIME and AMC
Achieved 48.13% and 33.33% pass@1 in AIME24 and AIME25 respectively
Programming competition problem solving
Used to solve programming competition problems such as Codeforces
Achieved 34.50% pass@1 in the Codeforces benchmark test
Research
STEM problem research
Used to solve complex problems in the STEM field
Achieved 41.78% pass@1 in the GPQA benchmark test
Logical puzzle research
Used to solve complex logical puzzles
Achieved 59.06% pass@1 in the reasoning benchmark test
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase