OpenMath2-Llama3.1-70B Open-Source Large Math Language Model - Accurately Solve Various Math Problems

Openmath2 Llama3.1 70B

Developed by nvidia

OpenMath2-Llama3.1-70B is a specialized large language model for mathematics, fine-tuned using the OpenMathInstruct-2 dataset based on the Llama3.1-70B-Base model.

Large Language Model

Transformers

English#Mathematical Problem Solving #High-Precision Mathematical Reasoning #Open-Source Math Fine-Tuning

Downloads 923

Release Time : 9/30/2024

Model Overview

This model focuses on mathematical problem-solving and outperforms the original Llama3.1-70B-Instruct model on multiple mathematical benchmarks.

Model Features

Enhanced Mathematical Performance

3.9% higher than Llama3.1-70B-Instruct on the MATH benchmark

Open-Source Process

Fully open-source data generation and model training pipeline

Specialized Math Capabilities

Focused on problem-solving in the field of mathematics

Model Capabilities

Mathematical Problem Solving

Mathematical Expression Calculation

Mathematical Reasoning

Use Cases

Education

Math Problem Solving

Helps students solve various math problems

Achieved 94.9% accuracy on the GSM8K test set

Academic Research

Math Competition Problem Solving

Solves problems from competitions like AMC and AIME

Scored 20/40 correct on AMC 2023

🚀 OpenMath2-Llama3.1-70B

OpenMath2-Llama3.1-70B is a model that addresses the need for high - performance in mathematical problem - solving. It is obtained by finetuning Llama3.1-70B-Base with OpenMathInstruct-2, offering enhanced mathematical capabilities compared to its base model.

🚀 Quick Start

OpenMath2-Llama3.1-70B is obtained by finetuning Llama3.1-70B-Base with OpenMathInstruct-2.

The model outperforms Llama3.1-70B-Instruct on MATH by 3.9%.

✨ Features

Performance Comparison

Model	GSM8K	MATH	AMC 2023	AIME 2024	Omni - MATH
Llama3.1-8B-Instruct	84.5	51.9	9/40	2/30	12.7
OpenMath2-Llama3.1-8B (nemo \| HF)	91.7	67.8	16/40	3/30	22.0
+ majority@256	94.1	76.1	23/40	3/30	24.6
Llama3.1-70B-Instruct	95.8	67.9	19/40	6/30	19.0
OpenMath2-Llama3.1-70B (nemo \| HF)	94.9	71.9	20/40	4/30	23.1
+ majority@256	96.0	79.6	24/40	6/30	27.6

Open - Source Resources

The pipeline used to produce the data and models is fully open - sourced!

💻 Usage Examples

Basic Usage

import transformers
import torch

model_id = "nvidia/OpenMath2-Llama3.1-70B"

pipeline = transformers.pipeline(
    "text-generation",
    model=model_id,
    model_kwargs={"torch_dtype": torch.bfloat16},
    device_map="auto",
)

messages = [
    {
        "role": "user", 
        "content": "Solve the following math problem. Make sure to put the answer (and only answer) inside \\boxed{}.\n\n" + 
        "What is the minimum value of $a^2+6a-7$?"},
]

outputs = pipeline(
    messages,
    max_new_tokens=4096,
)
print(outputs[0]["generated_text"][-1]['content'])

📚 Documentation

Model Details

Property	Details
Model Type	OpenMath2-Llama3.1-70B
Base Model	meta-llama/Llama-3.1-70B
Training Data	nvidia/OpenMathInstruct-2
Library Name	transformers
License	llama3.1

Usage Notes

Our models are trained with the same "chat format" as Llama3.1 - instruct models (same system/user/assistant tokens). Please note that these models have NOT been instruction - tuned on general data and thus might not provide good answers outside of the math domain.

We recommend using instructions in our repo to run inference with these models.

Reproducing Results

We provide all instructions to fully reproduce our results.

📄 License

By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the license, acceptable use policy and Meta’s privacy policy

📚 Citation

If you find our work useful, please consider citing us!

@article{toshniwal2024openmath2,
  title   = {OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data},
  author  = {Shubham Toshniwal and Wei Du and Ivan Moshkov and  Branislav Kisacanin and Alexan Ayrapetyan and Igor Gitman},
  year    = {2024},
  journal = {arXiv preprint arXiv:2410.01560}
}

⚠️ Important Note

These models have NOT been instruction - tuned on general data and thus might not provide good answers outside of the math domain.

💡 Usage Tip

We recommend using instructions in our repo to run inference with these models.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご