WizardMath-7B-V1.1開源數學大語言模型 - 免費部署解決複雜數學問題

首頁

Wizardmath 7B V1.1

由WizardLMTeam開發

WizardMath-7B-V1.1是基於Mistral-7B訓練的最先進的7B數學大語言模型，在GSM8k和MATH數據集上表現優異。

大型語言模型

Transformers

英語#數學推理 #強化進化指令 #7B領先模型

下載量 175.35k

發布時間 : 12/19/2023

模型概述

WizardMath通過強化進化指令（RLEIF）賦能大語言模型的數學推理能力，專注於解決數學問題。

模型特點

強化進化指令

通過RLEIF方法提升模型的數學推理能力。

高性能

在GSM8k和MATH數據集上達到當前最先進的性能。

開源

模型和代碼公開可用，便於研究和應用。

模型能力

數學問題解答

數學推理

文本生成

使用案例

教育

數學問題解答

幫助學生解答覆雜的數學問題。

在GSM8k上達到83.2 pass@1。

研究

數學推理研究

用於研究大語言模型在數學推理方面的能力。

在MATH上達到33.0 pass@1。

🚀 WizardMath：通過強化進化指令（RLEIF）賦能大語言模型的數學推理能力

WizardMath是一個專注於提升大語言模型數學推理能力的項目。通過強化進化指令（RLEIF）方法，該項目訓練出的模型在數學問題解決上表現出色，為大語言模型在數學領域的應用提供了強大支持。

🔗 項目鏈接

📢 最新消息

[2023年12月19日] 🔥 我們發佈了基於Mistral - 7B訓練的 WizardMath - 7B - V1.1，這是目前 最優的7B數學大語言模型，在GSM8k上達到了 83.2 pass@1，在MATH上達到了 33.0 pass@1。可以使用這個 [演示] 與它進行對話。
[2023年12月19日] 🔥 WizardMath - 7B - V1.1 在GSM8K pass@1上超越了 ChatGPT 3.5、Gemini Pro、Mixtral MOE 和 Claude Instant。
[2023年12月19日] 🔥 WizardMath - 7B - V1.1 在MATH pass@1上與 ChatGPT 3.5、Gemini Pro 相當，並且超越了 Mixtral MOE。

🌟 模型性能對比表

模型	檢查點	論文	GSM8k	MATH	演示
WizardMath - 7B - V1.1	🤗 HF鏈接	📃 WizardMath	83.2	33.0	[演示]
WizardMath - 70B - V1.0	🤗 HF鏈接	📃 WizardMath	81.6	22.7
WizardMath - 13B - V1.0	🤗 HF鏈接	📃 WizardMath	63.9	14.0
WizardMath - 7B - V1.0	🤗 HF鏈接	📃 WizardMath	54.9	10.7

[2023年12月19日] WizardMath - 7B - V1.1與其他開源7B規模數學大語言模型對比

模型	GSM8k Pass@1	MATH Pass@1
MPT - 7B	6.8	3.0
Llama 1 - 7B	11.0	2.9
Llama 2 - 7B	12.3	2.8
Yi - 6b	32.6	5.8
Mistral - 7B	37.8	9.1
Qwen - 7b	47.8	9.3
RFT - 7B	50.3	--
MAmmoTH - 7B (COT)	50.5	10.4
WizardMath - 7B - V1.0	54.9	10.7
Abel - 7B - 001	59.7	13
MetaMath - 7B	66.5	19.8
Arithmo - Mistral - 7B	74.7	25.3
MetaMath - Mistral - 7B	77.7	28.2
Abel - 7B - 002	80.4	29.5
WizardMath - 7B - V1.1	83.2	33.0

[2023年12月19日] WizardMath - 7B - V1.1與大型開源（30B ~ 70B）大語言模型對比

模型	GSM8k Pass@1	MATH Pass@1
Llemma - 34B	51.5	25.0
Minerva - 62B	52.4	27.6
Llama 2 - 70B	56.8	13.5
DeepSeek 67B	63.4	--
Gork 33B	62.9	23.9
MAmmoTH - 70B	72.4	21.1
Yi - 34B	67.9	15.9
Mixtral 8x7B	74.4	28.4
MetaMath - 70B	82.3	26.6
WizardMath - 7B - V1.1	83.2	33.0

⚠️ 數據汙染檢查

在模型訓練前，我們對所有訓練數據進行了仔細而嚴格的檢查，並使用多種去重方法來驗證和防止GSM8k和MATH測試集的數據洩露。

⚠️ 模型系統提示使用說明

請嚴格使用與我們 相同的系統提示，我們不保證 量化版本 的準確性。

默認版本

"Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\n{instruction}\n\n### Response:"

CoT版本（❗對於簡單的數學問題，我們不建議使用CoT提示）

"Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\n{instruction}\n\n### Response: Let's think step by step."

💻 WizardMath推理演示腳本

我們在這裡提供了WizardMath推理演示代碼。

📚 引用

如果您使用了本倉庫中的數據、方法或代碼，請引用本倉庫。

@article{luo2023wizardmath,
  title={WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct},
  author={Luo, Haipeng and Sun, Qingfeng and Xu, Can and Zhao, Pu and Lou, Jianguang and Tao, Chongyang and Geng, Xiubo and Lin, Qingwei and Chen, Shifeng and Zhang, Dongmei},
  journal={arXiv preprint arXiv:2308.09583},
  year={2023}
}