Nous-Hermes-2-Mistral-7B-DPO-AWQ開源AI模型 - 經優化測試表現出色好用！

Home

Nous Hermes 2 Mistral 7B DPO AWQ

Developed by solidrust

Nous Hermes 2是基於Mistral 7B DPO的新一代旗艦級7B Hermes模型，經過DPO優化，在多個測試基準上表現優異。

大型語言模型

Transformers

EnglishOpen Source License:Apache-2.0 #GPT4級對話 #DPO優化 #7B輕量化

Downloads 84

Release Time : 2/22/2024

Model Overview

該模型是基於Mistral 7B架構的大語言模型，經過DPO（直接偏好優化）訓練，專注於指令遵循和對話生成任務。

Model Features

DPO優化

經過直接偏好優化訓練，在AGIEval、BigBench Reasoning等基準測試中表現更優

高質量訓練數據

使用100萬條GPT-4質量或更優的指令/對話數據進行訓練

AWQ量化支持

支持4位AWQ量化，在保持質量的同時提高推理效率

ChatML格式支持

使用標準化的ChatML提示模板，便於對話系統集成

Model Capabilities

文本生成

對話系統

指令遵循

推理能力

Use Cases

對話系統

智能助手

構建能夠理解複雜指令並生成自然回應的AI助手

在多個基準測試中表現優於基礎模型

教育應用

教學輔助

用於生成教學內容和解答學生問題

🚀 Nous Hermes 2 - Mistral 7B - DPO

Nous Hermes 2 - Mistral 7B - DPO 是一款文本生成模型，基於 Mistral 7B 架構，經過 DPO 優化，在多個基準測試中表現出色，能處理多種自然語言任務。

🚀 快速開始

安裝必要的包

pip install --upgrade autoawq autoawq-kernels

Python 代碼示例

from awq import AutoAWQForCausalLM
from transformers import AutoTokenizer, TextStreamer

model_path = "solidrust/Nous-Hermes-2-Mistral-7B-DPO-AWQ"
system_message = "You are Hermes, incarnated a powerful AI."

# Load model
model = AutoAWQForCausalLM.from_quantized(model_path,
                                          fuse_layers=True)
tokenizer = AutoTokenizer.from_pretrained(model_path,
                                          trust_remote_code=True)
streamer = TextStreamer(tokenizer,
                        skip_prompt=True,
                        skip_special_tokens=True)

# Convert prompt to tokens
prompt_template = """\
<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant"""

prompt = "You're standing on the surface of the Earth. "\
        "You walk one mile south, one mile west and one mile north. "\
        "You end up exactly where you started. Where are you?"

tokens = tokenizer(prompt_template.format(system_message=system_message,prompt=prompt),
                  return_tensors='pt').input_ids.cuda()

# Generate output
generation_output = model.generate(tokens,
                                  streamer=streamer,
                                  max_new_tokens=512)

✨ 主要特性

微調與量化：經過微調與 4 位量化處理，採用 AWQ 方法，提高效率。
多框架支持：兼容 Transformers、PyTorch 等框架。
高質量訓練：基於 100 萬條 GPT - 4 質量或更高的指令/對話進行訓練，使用合成數據和其他高質量數據集。
特定提示模板：採用 ChatML 提示模板，便於使用。

📦 安裝指南

安裝必要的包，使用以下命令：

pip install --upgrade autoawq autoawq-kernels

💻 使用示例

基礎用法

from awq import AutoAWQForCausalLM
from transformers import AutoTokenizer, TextStreamer

model_path = "solidrust/Nous-Hermes-2-Mistral-7B-DPO-AWQ"
system_message = "You are Hermes, incarnated a powerful AI."

# Load model
model = AutoAWQForCausalLM.from_quantized(model_path,
                                          fuse_layers=True)
tokenizer = AutoTokenizer.from_pretrained(model_path,
                                          trust_remote_code=True)
streamer = TextStreamer(tokenizer,
                        skip_prompt=True,
                        skip_special_tokens=True)

# Convert prompt to tokens
prompt_template = """\
<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant"""

prompt = "You're standing on the surface of the Earth. "\
        "You walk one mile south, one mile west and one mile north. "\
        "You end up exactly where you started. Where are you?"

tokens = tokenizer(prompt_template.format(system_message=system_message,prompt=prompt),
                  return_tensors='pt').input_ids.cuda()

# Generate output
generation_output = model.generate(tokens,
                                  streamer=streamer,
                                  max_new_tokens=512)

📚 詳細文檔

模型信息

屬性	詳情
模型類型	Nous-Hermes-2-Mistral-7B-DPO
基礎模型	teknium/OpenHermes-2.5-Mistral-7B
訓練數據集	teknium/OpenHermes-2.5
量化者	Suparious
模型創建者	NousResearch
推理	不支持
提示模板	'<

關於 AWQ

AWQ 是一種高效、準確且極快的低比特權重量化方法，目前支持 4 位量化。與 GPTQ 相比，它在基於 Transformer 的推理中速度更快，並且在質量上與最常用的 GPTQ 設置相當或更好。

AWQ 模型目前僅在 Linux 和 Windows 系統上支持，且僅支持 NVIDIA GPU。macOS 用戶請使用 GGUF 模型。

支持的平臺和框架包括：

Text Generation Webui - 使用 Loader: AutoAWQ
vLLM - 版本 0.2.2 或更高版本支持所有模型類型
Hugging Face Text Generation Inference (TGI)
Transformers 版本 4.35.0 及更高版本，適用於任何支持 Transformers 的代碼或客戶端
AutoAWQ - 用於 Python 代碼

提示模板：ChatML

<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant

📄 許可證

本模型採用 Apache - 2.0 許可證。

BibTeX 引用

@misc{Nous-Hermes-2-Mistral-7B-DPO, 
      url={[https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)}, 
      title={Nous Hermes 2 Mistral 7B DPO}, 
      author={"Teknium", "theemozilla", "karan4d", "huemin_art"}
}