A

Acereason Nemotron 14B

Developed by nvidia
AceReason-Nemotron-14B is a math and code reasoning model trained through reinforcement learning, based on DeepSeek-R1-Distilled-Qwen-14B, excelling in math and code reasoning tasks.
Downloads 7,863
Release Time : 5/20/2025

Model Overview

AceReason-Nemotron-14B is a math and code reasoning model fully trained via reinforcement learning (RL), with DeepSeek-R1-Distilled-Qwen-14B as its base model. It demonstrates outstanding performance in math and code reasoning tasks. Through extensive ablation studies, the RL training process was systematically investigated, leading to a simple yet effective method: first training on pure math prompts via RL, followed by training on pure code prompts via RL.

Model Features

Reinforcement Learning Training
Fully trained via reinforcement learning (RL), significantly enhancing math and code reasoning capabilities.
Phased Training Approach
First trained on pure math prompts via RL, then on pure code prompts via RL, optimizing model performance.
High-Performance Reasoning
Demonstrates outstanding performance on benchmarks such as AIME 2024, AIME 2025, and LiveCodeBench.

Model Capabilities

Mathematical Reasoning
Code Generation
Text Generation
Reinforcement Learning

Use Cases

Mathematical Reasoning
Math Competition Problem Solving
Solves complex math competition problems, such as those from AIME 2024 and AIME 2025.
Achieved 78.6% on AIME 2024 (8.9% improvement) and 67.4% on AIME 2025 (17.4% improvement).
Code Generation
Code Competition Problem Solving
Generates Python code to solve code competition problems.
Achieved 61.1% on LiveCodeBench v5 (8% improvement) and 54.9% on LiveCodeBench v6 (7% improvement).
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase