Nous-Hermes-2-Mistral-7B-DPO-AWQ开源AI模型 - 经优化测试表现出色好用！

首页

Nous Hermes 2 Mistral 7B DPO AWQ

由 solidrust 开发

Nous Hermes 2是基于Mistral 7B DPO的新一代旗舰级7B Hermes模型，经过DPO优化，在多个测试基准上表现优异。

大型语言模型

Transformers

英语开源协议:Apache-2.0 #GPT4级对话 #DPO优化 #7B轻量化

下载量 84

发布时间 : 2/22/2024

模型简介

该模型是基于Mistral 7B架构的大语言模型，经过DPO（直接偏好优化）训练，专注于指令遵循和对话生成任务。

模型特点

DPO优化

经过直接偏好优化训练，在AGIEval、BigBench Reasoning等基准测试中表现更优

高质量训练数据

使用100万条GPT-4质量或更优的指令/对话数据进行训练

AWQ量化支持

支持4位AWQ量化，在保持质量的同时提高推理效率

ChatML格式支持

使用标准化的ChatML提示模板，便于对话系统集成

模型能力

文本生成

对话系统

指令遵循

推理能力

使用案例

对话系统

智能助手

构建能够理解复杂指令并生成自然回应的AI助手

在多个基准测试中表现优于基础模型

教育应用

教学辅助

用于生成教学内容和解答学生问题

🚀 Nous Hermes 2 - Mistral 7B - DPO

Nous Hermes 2 - Mistral 7B - DPO 是一款文本生成模型，基于 Mistral 7B 架构，经过 DPO 优化，在多个基准测试中表现出色，能处理多种自然语言任务。

🚀 快速开始

安装必要的包

pip install --upgrade autoawq autoawq-kernels

Python 代码示例

from awq import AutoAWQForCausalLM
from transformers import AutoTokenizer, TextStreamer

model_path = "solidrust/Nous-Hermes-2-Mistral-7B-DPO-AWQ"
system_message = "You are Hermes, incarnated a powerful AI."

# Load model
model = AutoAWQForCausalLM.from_quantized(model_path,
                                          fuse_layers=True)
tokenizer = AutoTokenizer.from_pretrained(model_path,
                                          trust_remote_code=True)
streamer = TextStreamer(tokenizer,
                        skip_prompt=True,
                        skip_special_tokens=True)

# Convert prompt to tokens
prompt_template = """\
<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant"""

prompt = "You're standing on the surface of the Earth. "\
        "You walk one mile south, one mile west and one mile north. "\
        "You end up exactly where you started. Where are you?"

tokens = tokenizer(prompt_template.format(system_message=system_message,prompt=prompt),
                  return_tensors='pt').input_ids.cuda()

# Generate output
generation_output = model.generate(tokens,
                                  streamer=streamer,
                                  max_new_tokens=512)

✨ 主要特性

微调与量化：经过微调与 4 位量化处理，采用 AWQ 方法，提高效率。
多框架支持：兼容 Transformers、PyTorch 等框架。
高质量训练：基于 100 万条 GPT - 4 质量或更高的指令/对话进行训练，使用合成数据和其他高质量数据集。
特定提示模板：采用 ChatML 提示模板，便于使用。

📦 安装指南

安装必要的包，使用以下命令：

pip install --upgrade autoawq autoawq-kernels

💻 使用示例

基础用法

from awq import AutoAWQForCausalLM
from transformers import AutoTokenizer, TextStreamer

model_path = "solidrust/Nous-Hermes-2-Mistral-7B-DPO-AWQ"
system_message = "You are Hermes, incarnated a powerful AI."

# Load model
model = AutoAWQForCausalLM.from_quantized(model_path,
                                          fuse_layers=True)
tokenizer = AutoTokenizer.from_pretrained(model_path,
                                          trust_remote_code=True)
streamer = TextStreamer(tokenizer,
                        skip_prompt=True,
                        skip_special_tokens=True)

# Convert prompt to tokens
prompt_template = """\
<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant"""

prompt = "You're standing on the surface of the Earth. "\
        "You walk one mile south, one mile west and one mile north. "\
        "You end up exactly where you started. Where are you?"

tokens = tokenizer(prompt_template.format(system_message=system_message,prompt=prompt),
                  return_tensors='pt').input_ids.cuda()

# Generate output
generation_output = model.generate(tokens,
                                  streamer=streamer,
                                  max_new_tokens=512)

📚 详细文档

模型信息

属性	详情
模型类型	Nous-Hermes-2-Mistral-7B-DPO
基础模型	teknium/OpenHermes-2.5-Mistral-7B
训练数据集	teknium/OpenHermes-2.5
量化者	Suparious
模型创建者	NousResearch
推理	不支持
提示模板	'<

关于 AWQ

AWQ 是一种高效、准确且极快的低比特权重量化方法，目前支持 4 位量化。与 GPTQ 相比，它在基于 Transformer 的推理中速度更快，并且在质量上与最常用的 GPTQ 设置相当或更好。

AWQ 模型目前仅在 Linux 和 Windows 系统上支持，且仅支持 NVIDIA GPU。macOS 用户请使用 GGUF 模型。

支持的平台和框架包括：

Text Generation Webui - 使用 Loader: AutoAWQ
vLLM - 版本 0.2.2 或更高版本支持所有模型类型
Hugging Face Text Generation Inference (TGI)
Transformers 版本 4.35.0 及更高版本，适用于任何支持 Transformers 的代码或客户端
AutoAWQ - 用于 Python 代码

提示模板：ChatML

<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant

📄 许可证

本模型采用 Apache - 2.0 许可证。

BibTeX 引用

@misc{Nous-Hermes-2-Mistral-7B-DPO, 
      url={[https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)}, 
      title={Nous Hermes 2 Mistral 7B DPO}, 
      author={"Teknium", "theemozilla", "karan4d", "huemin_art"}
}