MetaMath-Llemma-7B Open-Source Mathematical Large Model - Free Deployment, Significantly Improve the Ability to Solve Mathematical Problems

Metamath Llemma 7B

Developed by meta-math

MetaMath-Llemma-7B is a math-specialized large language model fully fine-tuned on the MetaMathQA dataset based on the Llemma-7B foundation model, significantly improving mathematical problem-solving capabilities.

Large Language Model

Transformers

Open Source License:Apache-2.0 #Enhanced Mathematical Reasoning #Step-by-Step Thinking Prompts #Specialized for Education

Downloads 3,192

Release Time : 11/19/2023

Model Overview

This model focuses on solving mathematical problems, with significantly improved performance on mathematical datasets such as GSM8K and MATH through enhanced training data.

Model Features

Enhanced Mathematical Capabilities

Fine-tuned on the MetaMathQA dataset, significantly improving mathematical problem-solving abilities, with MATH performance increasing from 19.8 to 30.0.

Optimized Based on Llemma-7B

Uses the Llemma-7B model, specifically optimized for mathematics, as the foundation, showing significant performance improvements over the standard LLaMA-2-7B.

Training Data Validation

All enhanced data originates from the GSM8K and MATH training sets, excluding test set content to ensure fair evaluation.

Model Capabilities

Mathematical Problem Solving

Step-by-Step Reasoning

Mathematical Expression Processing

Use Cases

Education

Mathematical Problem Solving

Helps students understand and solve various mathematical problems

Achieves 69.2% accuracy on GSM8K

Research

Mathematical Reasoning Research

Used to study the mathematical reasoning capabilities of large language models

Achieves 30.0% accuracy on MATH

🚀 MetaMath-Llemma-7B

MetaMath-Llemma-7B is a fully fine - tuned model on the MetaMathQA datasets, based on the powerful Llemma - 7B model, aiming to enhance mathematical question - answering capabilities.

🚀 Quick Start

See our paper at https://arxiv.org/abs/2309.12284.

View the project page: [https://meta - math.github.io/](https://meta - math.github.io/)

✨ Features

All MetaMathQA data are augmented from the training sets of GSM8K and MATH. None of the augmented data is from the testing set.
MetaMath - Llemma - 7B is fully fine - tuned on the MetaMathQA datasets and based on the Llemma - 7B model, which can boost the MATH performance from 19.8 to 30.0.

📦 Installation

pip install transformers==4.35.0
pip install torch==2.0.1
pip install sentencepiece==0.1.99
pip install tokenizers==0.13.3
pip install accelerate==0.21.0
pip install bitsandbytes==0.40.0
pip install vllm
pip install fraction
pip install protobuf

💻 Usage Examples

Basic Usage

Prompting template:

"Below is an instruction that describes a task. "
"Write a response that appropriately completes the request.\n\n"
"### Instruction:\n{instruction}\n\n### Response: Let's think step by step."

where you need to use your query question to replace the {instruction}.

📚 Documentation

Experiments

Property	Details
Model Type	MetaMath-Llemma-7B, MetaMath-Mistral-7B etc.
Training Data	meta-math/MetaMathQA

Model	GSM8k Pass@1	MATH Pass@1
MPT-7B	6.8	3.0
Falcon-7B	6.8	2.3
LLaMA-1-7B	11.0	2.9
LLaMA-2-7B	14.6	2.5
MPT-30B	15.2	3.1
LLaMA-1-13B	17.8	3.9
GPT-Neo-2.7B	19.5	--
Falcon-40B	19.6	2.5
Baichuan-chat-13B	23.9	--
Vicuna-v1.3-13B	27.6	--
LLaMA-2-13B	28.7	3.9
InternLM-7B	31.2	--
ChatGLM-2-6B	32.4	--
GPT-J-6B	34.9	--
LLaMA-1-33B	35.6	3.9
LLaMA-2-34B	42.2	6.24
RFT-7B	50.3	--
LLaMA-1-65B	50.9	10.6
Qwen-7B	51.6	--
WizardMath-7B	54.9	10.7
LLaMA-2-70B	56.8	13.5
WizardMath-13B	63.9	14.0
MAmmoTH-7B (COT)	50.5	10.4
MAmmoTH-7B (POT+COT)	53.6	31.5
Arithmo-Mistral-7B	74.7	25.3
MetaMath-7B	66.5	19.8
MetaMath-13B	72.3	22.4
🔥 MetaMath-Llemma-7B	69.2	30.0
🔥 MetaMath-Mistral-7B	77.7	28.2

📄 License

This project is licensed under the Apache-2.0 license.

📚 Citation

@article{yu2023metamath,
  title={MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models},
  author={Yu, Longhui and Jiang, Weisen and Shi, Han and Yu, Jincheng and Liu, Zhengying and Zhang, Yu and Kwok, James T and Li, Zhenguo and Weller, Adrian and Liu, Weiyang},
  journal={arXiv preprint arXiv:2309.12284},
  year={2023}
}

@article{azerbayev2023llemma,
  title={Llemma: An open language model for mathematics},
  author={Azerbayev, Zhangir and Schoelkopf, Hailey and Paster, Keiran and Santos, Marco Dos and McAleer, Stephen and Jiang, Albert Q and Deng, Jia and Biderman, Stella and Welleck, Sean},
  journal={arXiv preprint arXiv:2310.10631},
  year={2023}
}

⚠️ Important Note

All MetaMathQA data are augmented from the training sets of GSM8K and MATH. None of the augmented data is from the testing set. You can check the original_question in meta-math/MetaMathQA, each item is from the GSM8K or MATH train set.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご