AceMath-72B-Instruct Open-source Mathematical Reasoning Model - Free Deployment to Solve English Mathematical Problems

Acemath 72B Instruct

Developed by nvidia

AceMath is a series of cutting-edge models designed specifically for mathematical reasoning. It is improved based on Qwen and is good at using chain-of-thought (CoT) reasoning to solve English mathematical problems.

Large Language Model

Safetensors

English#Mathematical reasoning #Chain-of-thought reasoning #Reward evaluation

Downloads 3,141

Release Time : 1/14/2025

Model Overview

The AceMath series of models focuses on mathematical reasoning tasks, including instruction models and reward models, and is suitable for solving complex mathematical problems and evaluating mathematical solutions.

Model Features

Powerful mathematical reasoning ability

The AceMath model performs excellently in various mathematical reasoning benchmark tests, significantly outperforming the best models in the same category.

Professional reward model

The AceMath - RM model is specifically used to evaluate and score mathematical solutions, setting a new record in the reasoning benchmark test.

Multi-stage supervised fine-tuning

Adopt a multi-stage supervised fine-tuning (SFT) process, first using general SFT data and then using specific mathematical SFT data.

Model Capabilities

Mathematical problem solving

Mathematical reasoning

Mathematical solution evaluation

Chain-of-thought (CoT) reasoning

Use Cases

Education

Mathematical competition problem solving

Solve complex mathematical competition problems, such as probability calculations, combinatorial mathematics, etc.

Performs excellently in mathematical reasoning benchmark tests, surpassing GPT-4o and Claude 3.5 Sonnet.

Research

Mathematical solution evaluation

Use the reward model to evaluate and score mathematical solutions.

Sets a new record for rm@8 accuracy (best of 8) in the reasoning benchmark test.

🚀 AceMath: Frontier Models for Mathematical Reasoning

AceMath is a collection of cutting - edge models crafted for mathematical reasoning. These models, including various versions like AceMath - 1.5B/7B/72B - Instruct and AceMath - 7B/72B - RM, are enhanced using Qwen. The Instruct models are great at solving English math problems with Chain - of - Thought (CoT) reasoning, while the RM models are specialized in evaluating and scoring math solutions.

🚀 Quick Start

To quickly get started with AceMath, you can use the following code example to interact with the model:

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "nvidia/AceMath-72B-Instruct"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype="auto", device_map="auto")

prompt = "Jen enters a lottery by picking $4$ distinct numbers from $S=\\{1,2,3,\\cdots,9,10\\}.$ $4$ numbers are randomly chosen from $S.$ She wins a prize if at least two of her numbers were $2$ of the randomly chosen numbers, and wins the grand prize if all four of her numbers were the randomly chosen numbers. The probability of her winning the grand prize given that she won a prize is $\\tfrac{m}{n}$ where $m$ and $n$ are relatively prime positive integers. Find $m+n$."
messages = [{"role": "user", "content": prompt}]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to("cuda")

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]

✨ Features

Powerful Math Reasoning: The AceMath - 1.5B/7B/72B - Instruct models can solve a wide range of English mathematical problems using CoT reasoning.
Accurate Solution Evaluation: The AceMath - 7B/72B - RM models can effectively evaluate and score mathematical solutions.
High - Performance: AceMath models outperform many leading proprietary and open - access math models in various math reasoning benchmarks.

📦 Installation

The installation process is mainly about setting up the necessary Python environment and installing the transformers library. You can use the following command to install the transformers library:

pip install transformers

💻 Usage Examples

Basic Usage

# The basic usage code is the same as the quick - start code
from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "nvidia/AceMath-72B-Instruct"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype="auto", device_map="auto")

prompt = "Jen enters a lottery by picking $4$ distinct numbers from $S=\\{1,2,3,\\cdots,9,10\\}.$ $4$ numbers are randomly chosen from $S.$ She wins a prize if at least two of her numbers were $2$ of the randomly chosen numbers, and wins the grand prize if all four of her numbers were the randomly chosen numbers. The probability of her winning the grand prize given that she won a prize is $\\tfrac{m}{n}$ where $m$ and $n$ are relatively prime positive integers. Find $m+n$."
messages = [{"role": "user", "content": prompt}]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to("cuda")

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]

📚 Documentation

The AceMath - 1.5B/7B/72B - Instruct models are developed from the Qwen2.5 - Math - 1.5B/7B/72B - Base models through a multi - stage supervised fine - tuning (SFT) process. First, they are fine - tuned with general - purpose SFT data, and then with math - specific SFT data. All training data is released to support further research in this field.

We only recommend using the AceMath models for solving math problems. For other tasks, we also release AceInstruct - 1.5B/7B/72B, a series of general - purpose SFT models that can handle code, math, and general knowledge tasks. These models are built upon the Qwen2.5 - 1.5B/7B/72B - Base.

For more information about AceMath, check our website and paper.

🔧 Technical Details

The AceMath models are improved using Qwen. The Instruct models are fine - tuned with a multi - stage SFT process. The RM models are designed to evaluate and score mathematical solutions.

📄 License

All models in the AceMath family are for non - commercial use only, subject to [Terms of Use](https://openai.com/policies/row - terms - of - use/) of the data generated by OpenAI. We put the AceMath models under the license of [Creative Commons Attribution: Non - Commercial 4.0 International](https://spdx.org/licenses/CC - BY - NC - 4.0).

All Resources

Benchmark Results (AceMath - Instruct + AceMath - 72B - RM)

AceMath Benchmark Results

We compare AceMath to leading proprietary and open - access math models in the above table. Our AceMath - 7B - Instruct largely outperforms the previous best - in - class Qwen2.5 - Math - 7B - Instruct (Average pass@1: 67.2 vs. 62.9) on a variety of math reasoning benchmarks, while coming close to the performance of 10× larger Qwen2.5 - Math - 72B - Instruct (67.2 vs. 68.2). Notably, our AceMath - 72B - Instruct outperforms the state - of - the - art Qwen2.5 - Math - 72B - Instruct (71.8 vs. 68.2), GPT - 4o (67.4) and Claude 3.5 Sonnet (65.6) by a margin. We also report the rm@8 accuracy (best of 8) achieved by our reward model, AceMath - 72B - RM, which sets a new record on these reasoning benchmarks. This excludes OpenAI’s o1 model, which relies on scaled inference computation.

Correspondence to

Zihan Liu (zihanl@nvidia.com), Yang Chen (yachen@nvidia.com), Wei Ping (wping@nvidia.com)

Citation

If you find our work helpful, we’d appreciate it if you could cite us.

@article{acemath2024,
  title={AceMath: Advancing Frontier Math Reasoning with Post - Training and Reward Modeling},
  author={Liu, Zihan and Chen, Yang and Shoeybi, Mohammad and Catanzaro, Bryan and Ping, Wei},
  journal={arXiv preprint},
  year={2024}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご