A

Acereason Nemotron 7B

Developed by nvidia
A math and code reasoning model trained through reinforcement learning, based on DeepSeek-R1-Distilled-Qwen-7B, excelling in mathematical and code reasoning tasks
Downloads 4,278
Release Time : 5/22/2025

Model Overview

AceReason-Nemotron-7B is a math and code reasoning model fully trained through reinforcement learning (RL), with DeepSeek-R1-Distilled-Qwen-7B as its base model. This model has achieved significant improvements in mathematical and code reasoning tasks.

Model Features

Reinforcement Learning Training
Fully trained through reinforcement learning (RL), significantly improving mathematical and code reasoning capabilities
Mathematical Reasoning Capability
Achieved 69.0% on AIME 2024 (14.5% improvement) and 53.6% on AIME 2025 (17.4% improvement)
Code Reasoning Capability
Achieved 51.8% on LiveCodeBench v5 (8% improvement) and 44.1% on LiveCodeBench v6 (7% improvement)
Training Method Innovation
First trained with pure math prompts via RL, then with pure code prompts via RL, yielding significant results

Model Capabilities

Mathematical Reasoning
Code Generation
Complex Problem Solving
Step-by-step Reasoning

Use Cases

Math Competitions
AIME Math Competition Problem Solving
Solving complex problems in AIME math competitions
Achieved 69.0% accuracy on AIME 2024
Programming Competitions
LiveCodeBench Programming Problem Solving
Solving programming problems in LiveCodeBench
Achieved 51.8% accuracy on LiveCodeBench v5
Educational Assistance
Math Learning Assistance
Helping students understand complex math concepts and problem-solving methods
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase