OpenCodeReasoning-Nemotron-32B开源大模型 - 支持长文本代码生成推理，商用免费！

首页

Opencodereasoning Nemotron 32B

由 nvidia 开发

OpenCodeReasoning-Nemotron-32B 是一个基于 Qwen2.5-32B-Instruct 的大型语言模型，专门针对代码生成推理能力进行训练，支持 32K token 的上下文长度，可自由用于商业/非商业用途。

大型语言模型

Transformers

支持多种语言开源协议:Apache-2.0 #代码生成推理 #32K长上下文 #商业授权

下载量 1,176

发布时间 : 4/15/2025

模型简介

该模型专注于代码生成和推理任务，经过专项训练以提高在编程问题解决方面的能力，特别适合处理复杂的代码生成和算法推理问题。

模型特点

32K 上下文长度

支持长达 32K token 的上下文处理能力，适合处理复杂的代码生成任务。

代码生成推理专项训练

针对代码生成和推理任务进行专项训练，提高了在编程问题解决方面的能力。

商业/非商业自由使用

模型可自由用于商业和非商业用途，无使用限制。

模型能力

代码生成

算法推理

编程问题解决

使用案例

编程教育

编程题目解答

帮助学生理解和解决编程题目，提供详细的代码解决方案。

在 LiveCodeBench 和 CodeContest 等基准测试中表现优异。

软件开发

算法实现

快速生成高效的算法实现代码，提升开发效率。

在复杂算法任务中表现出色。

🚀 OpenCodeReasoning-Nemotron-32B 概述

OpenCodeReasoning-Nemotron-32B 是一个大语言模型（LLM），它基于 Qwen2.5 - 32B - Instruct（即参考模型）衍生而来。这是一个经过推理后训练的代码生成推理模型，支持 32K 令牌的上下文长度。该模型可用于商业或非商业用途。

评估结果

✨ 主要特性

基于 Qwen2.5 - 32B - Instruct 模型进行衍生和优化。
专为代码生成推理进行后训练。
支持 32K 令牌的上下文长度。
可用于商业和非商业用途。

📦 安装指南

文档未提供具体安装步骤，故跳过此章节。

💻 使用示例

基础用法

要对编码问题进行推理，请使用以下代码：

import transformers
import torch

model_id = "nvidia/OpenCodeReasoning-Nemotron-32B"

pipeline = transformers.pipeline(
    "text-generation",
    model=model_id,
    model_kwargs={"torch_dtype": torch.bfloat16},
    device_map="auto",
)

prompt = """You are a helpful and harmless assistant. You should think step-by-step before responding to the instruction below.

Please use python programming language only.

You must use ```python for just the final solution code block with the following format:
```python
# Your code here

{user} """

messages = [ { "role": "user", "content": prompt.format(user="Write a program to calculate the sum of the first $N$ fibonacci numbers")}, ]

outputs = pipeline( messages, max_new_tokens=32768, ) print(outputs[0]["generated_text"][-1]['content'])


## 📚 详细文档

### 来自 [OpenCodeReasoning](https://arxiv.org/abs/2504.01943) 的结果
以下结果是每个基准测试进行 **64 次评估** 的平均值。

| 模型 | LiveCodeBench 平均值 | CodeContest 全部 |
| ---- | ---- | ---- |
| DeepSeek - R1 | 65.6 | 26.2 |
| QwQ - 32B | 61.3 | 20.2 |
| **蒸馏 7B+ 模型** |  |  |
| Bespoke - Stratos - 7B | 14.7 | 2.0 |
| OpenThinker - 7B | 25.5 | 5.0 |
| R1 - Distill - Qwen - 7B | 38.0 | 11.1 |
| OlympicCoder - 7B | 40.9 | 10.6 |
| **OCR - Qwen - 7B** | **48.5** | **16.3** |
| **OCR - Qwen - 7B - Instruct** | **51.3** | **18.1** |
| **蒸馏 14B+ 模型** |  |  |
| R1 - Distill - Qwen - 14B | 51.3 | 17.6 |
| **OCR - Qwen - 14B** | **57.7** | **22.6** |
| **OCR - Qwen - 14B - Instruct** | **59.4** | **23.6** |
| **蒸馏 32B+ 模型** |  |  |
| Bespoke - Stratos - 32B | 30.1 | 6.3 |
| OpenThinker - 32B | 54.1 | 16.4 |
| R1 - Distill - Qwen - 32B | 58.1 | 18.3 |
| OlympicCoder - 32B | 57.4 | 18.0 |
| **OCR - Qwen - 32B** | **61.8** | **24.6** |
| **OCR - Qwen - 32B - Instruct** | **61.7** | **24.4** |

### 复现结果
- [模型](https://huggingface.co/collections/nvidia/opencodereasoning-2-68168f37cd7c6beb1e3f92e7)
- [数据集](https://huggingface.co/datasets/nvidia/OpenCodeReasoning)
- [论文](https://arxiv.org/abs/2504.01943)

### 模型信息
| 属性 | 详情 |
| ---- | ---- |
| 模型类型 | 密集型仅解码器 Transformer 模型 |
| 网络架构 | Qwen - 32B - Instruct |
| 输入类型 | 文本 |
| 输入格式 | 字符串 |
| 输入参数 | 一维（1D） |
| 输入相关其他属性 | 上下文长度可达 32,768 个令牌 |
| 输出类型 | 文本 |
| 输出格式 | 字符串 |
| 输出参数 | 一维（1D） |
| 输出相关其他属性 | 上下文长度可达 32,768 个令牌 |
| 运行时引擎 | NeMo 2.3.0 |
| 推荐硬件微架构兼容性 | NVIDIA Ampere、NVIDIA Hopper |
| 首选/支持的操作系统 | Linux |
| 模型版本 | 1.0 (4/25/2025)，包含 OpenCodeReasoning - Nemotron - 7B、OpenCodeReasoning - Nemotron - 14B、OpenCodeReasoning - Nemotron - 32B、OpenCodeReasoning - Nemotron - 32B - IOI |
| 推理引擎 | vLLM |
| 测试硬件 | NVIDIA H100 - 80GB |

### 训练和评估数据集
#### 训练数据集
OpenCodeReasoning - Nemotron - 32B 的训练语料库是 [OpenCodeReasoning](https://huggingface.co/datasets/nvidia/OpenCodeReasoning) 数据集，它由竞赛编程问题和 DeepSeek - R1 生成的响应组成。
- 数据收集方法：混合方式（自动化、人工、合成）
- 标注方法：混合方式（自动化、人工、合成）
- 属性：来自 OpenCodeReasoning 的 736k 个样本（https://huggingface.co/datasets/nvidia/OpenCodeReasoning）

#### 评估数据集
使用下一节列出的数据集对 OpenCodeReasoning - Nemotron - 32B 进行评估。
- 数据收集方法：混合方式（自动化、人工、合成）
- 标注方法：混合方式（自动化、人工、合成）

### 许可证/使用条款
本模型的使用受 [Apache 2.0](https://huggingface.co/nvidia/OpenCode - Nemotron - 2 - 7B/blob/main/LICENSE) 条款约束。

### 部署地域
全球

### 使用场景
该模型适用于构建大语言模型的开发者和研究人员。

### 发布日期
2025 年 4 月 25 日通过 Huggingface（https://huggingface.co/nvidia/OpenCodeReasoning - Nemotron - 32B/）发布

### 参考资料
[2504.01943] OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

### 伦理考量
NVIDIA 认为可信 AI 是一项共同责任，已制定相关政策和实践，以支持广泛的 AI 应用开发。开发者在按照服务条款下载或使用此模型时，应与内部模型团队合作，确保该模型符合相关行业和使用场景的要求，并解决意外的产品滥用问题。请在此报告安全漏洞或 NVIDIA AI 相关问题。

### 引用
如果您觉得这些数据有用，请引用：

@article{ahmad2025opencodereasoning, title={OpenCodeReasoning: Advancing Data Distillation for Competitive Coding}, author={Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg}, year={2025}, eprint={2504.01943}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2504.01943}, }