WizardCoder-Python-34B-V1.0开源代码生成模型

首页

Wizardcoder Python 7B V1.0

由 vanillaOVO 开发

WizardCoder-Python-34B-V1.0 是一个高性能的代码生成模型，基于Llama2架构，专注于Python代码生成任务。

大型语言模型

Transformers

其他#代码生成 #数学推理 #多任务问答

下载量 2,206

发布时间 : 6/19/2024

模型简介

该模型在代码生成领域表现出色，特别擅长Python代码生成，适用于开发辅助、自动化编程等场景。

模型特点

高性能代码生成

在HumanEval基准测试中达到73.2 pass@1，超越GPT4和Claude2等模型。

多尺寸模型选择

提供从1B到34B不同参数规模的模型版本，适应不同计算需求。

开源可商用

基于Llama2许可证发布，允许商业使用。

模型能力

Python代码生成

代码补全

代码解释

编程问题解答

使用案例

开发辅助

自动化代码生成

根据自然语言描述自动生成Python代码片段

在HumanEval测试中达到73.2%的正确率

编程教育

帮助学生理解编程概念和解决编程问题

软件开发

代码补全

在IDE中提供智能代码补全建议

🚀 WizardCoder与WizardLM系列模型项目

本项目主要围绕WizardCoder、WizardMath和WizardLM等一系列模型展开，这些模型在代码生成、数学推理、通用问答等多个领域展现出了卓越的性能，为自然语言处理和人工智能研究提供了强大的工具和参考。

项目说明

这是官方仓库的副本，仅用于研究目的以复现结果。若存在版权问题，请联系我们。

项目链接

🤗 HF仓库 •🐱 Github仓库 • 🐦 Twitter • 📃 [WizardLM] • 📃 [WizardCoder] • 📃 [WizardMath]

👋 加入我们的 Discord

✨ 主要特性

模型信息

属性	详情
许可证	llama2
评估指标	code_eval
库名称	transformers
标签	code

模型索引

名称：WizardCoder-Python-34B-V1.0
结果：
- 任务类型：文本生成
- 数据集：openai_humaneval（HumanEval）
- 指标：
  - 名称：pass@1
  - 类型：pass@1
  - 值：0.555
  - 验证状态：未验证

📚 详细文档

WizardCoder系列模型表现

模型	检查点	论文	HumanEval	MBPP	演示	许可证
WizardCoder-Python-34B-V1.0	🤗 HF链接	📃 [WizardCoder]	73.2	61.2	演示	Llama2
WizardCoder-15B-V1.0	🤗 HF链接	📃 [WizardCoder]	57.3	50.6	--	OpenRAIL - M
WizardCoder-Python-13B-V1.0	🤗 HF链接	📃 [WizardCoder]	64.0	55.6	--	Llama2
WizardCoder-Python-7B-V1.0	🤗 HF链接	📃 [WizardCoder]	55.5	51.6	演示	Llama2
WizardCoder-3B-V1.0	🤗 HF链接	📃 [WizardCoder]	34.8	37.4	--	OpenRAIL - M
WizardCoder-1B-V1.0	🤗 HF链接	📃 [WizardCoder]	23.8	28.6	--	OpenRAIL - M

WizardMath系列模型表现

我们的 WizardMath-70B-V1.0 模型在GSM8K基准测试中略微优于一些闭源大语言模型，包括 ChatGPT 3.5、Claude Instant 1 和 PaLM 2 540B。
我们的 WizardMath-70B-V1.0 模型在 GSM8k基准测试中达到了 81.6 pass@1，比当前最优的开源大语言模型高出 24.8 分；在 MATH基准测试中达到了 22.7 pass@1，比当前最优的开源大语言模型高出 9.2 分。

模型	检查点	论文	GSM8k	MATH	在线演示	许可证
WizardMath-70B-V1.0	🤗 HF链接	📃 [WizardMath]	81.6	22.7	演示	Llama 2
WizardMath-13B-V1.0	🤗 HF链接	📃 [WizardMath]	63.9	14.0	演示	Llama 2
WizardMath-7B-V1.0	🤗 HF链接	📃 [WizardMath]	54.9	10.7	演示	Llama 2

WizardLM系列模型表现

[08/09/2023] 我们发布了 WizardLM-70B-V1.0 模型。完整模型权重。

模型	检查点	论文	MT - Bench	AlpacaEval	GSM8k	HumanEval	许可证
WizardLM-70B-V1.0	🤗 HF链接	📃即将发布	7.78	92.91%	77.6%	50.6	Llama 2许可证
WizardLM-13B-V1.2	🤗 HF链接		7.06	89.17%	55.3%	36.6	Llama 2许可证
WizardLM-13B-V1.1	🤗 HF链接		6.76	86.32%		25.0	非商业用途
WizardLM-30B-V1.0	🤗 HF链接		7.01			37.8	非商业用途
WizardLM-13B-V1.0	🤗 HF链接		6.35	75.31%		24.0	非商业用途
WizardLM-7B-V1.0	🤗 HF链接	📃 [WizardLM]				19.1	非商业用途

模型对比

🔥 下图显示，我们的 WizardCoder-Python-34B-V1.0在该基准测试中排名第二，超越了GPT4 (2023/03/15, 73.2 vs. 67.0)、ChatGPT - 3.5 (73.2 vs. 72.5) 和Claude2 (73.2 vs. 71.2)。

提示格式

"Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\n{instruction}\n\n### Response:"

推理演示脚本

我们在此处提供了推理演示代码。

📄 许可证

本项目使用llama2许可证。

🔗 引用信息

如果您使用了本仓库中的数据、方法或代码，请引用以下论文：

@article{luo2023wizardcoder,
  title={WizardCoder: Empowering Code Large Language Models with Evol-Instruct},
  author={Luo, Ziyang and Xu, Can and Zhao, Pu and Sun, Qingfeng and Geng, Xiubo and Hu, Wenxiang and Tao, Chongyang and Ma, Jing and Lin, Qingwei and Jiang, Daxin},
  journal={arXiv preprint arXiv:2306.08568},
  year={2023}
}