h2ogpt-gm-oasst1-en-2048-open-llama-3b开源大语言模型

首页

H2ogpt Gm Oasst1 En 2048 Open Llama 3b

由 h2oai 开发

基于OpenAssistant/oasst1数据集微调的3B参数大语言模型，使用H2O LLM Studio训练

大型语言模型

Transformers

英语开源协议:Apache-2.0 #英语对话生成 #指令微调模型 #3B参数量级

下载量 139

发布时间 : 6/28/2023

模型简介

该模型是基于Open Llama 3B架构微调的大语言模型，专注于英语文本生成任务，采用Apache-2.0许可证发布。

模型特点

高效微调

使用H2O LLM Studio工具对基础模型进行高效微调

对话优化

基于OpenAssistant对话数据集优化，适合对话场景

开源许可

采用Apache-2.0许可证，允许商业用途

模型能力

文本生成

对话系统

问答系统

使用案例

对话系统

智能客服

用于构建自动客服对话系统

内容生成

文章创作

辅助生成各类文本内容

🚀 模型卡片

本模型基于H2O LLM Studio训练，使用了特定的基础模型和数据集，可借助transformers库在GPU机器上使用。以下将详细介绍模型的相关信息，包括使用方法、架构、配置等。

🚀 快速开始

本模型使用 H2O LLM Studio 进行训练。

基础模型：openlm-research/open_llama_3b
数据集准备：OpenAssistant/oasst1 个性化处理

✨ 主要特性

基于特定基础模型：以openlm-research/open_llama_3b为基础模型进行训练。
使用特定数据集：使用OpenAssistant/oasst1数据集，并进行了个性化处理。
支持transformers库：可方便地在支持GPU的机器上使用transformers库调用该模型。

📦 安装指南

要在配备GPU的机器上使用transformers库调用此模型，首先要确保已安装transformers、accelerate和torch库。

pip install transformers==4.30.2
pip install accelerate==0.20.3
pip install torch==2.0.0

💻 使用示例

基础用法

import torch
from transformers import pipeline

generate_text = pipeline(
    model="h2oai/h2ogpt-gm-oasst1-en-2048-open-llama-3b",
    torch_dtype="auto",
    trust_remote_code=True,
    use_fast=False,
    device_map={"": "cuda:0"},
)

res = generate_text(
    "Why is drinking water so healthy?",
    min_new_tokens=2,
    max_new_tokens=1024,
    do_sample=False,
    num_beams=1,
    temperature=float(0.3),
    repetition_penalty=float(1.2),
    renormalize_logits=True
)
print(res[0]["generated_text"])

你可以在预处理步骤后打印一个示例提示，以查看它是如何输入到分词器中的：

print(generate_text.preprocess("Why is drinking water so healthy?")["prompt_text"])

<|prompt|>Why is drinking water so healthy?</s><|answer|>

高级用法

你也可以从加载的模型和分词器自行构建管道，并考虑预处理步骤：

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "h2oai/h2ogpt-gm-oasst1-en-2048-open-llama-3b"  # either local folder or huggingface model name
# Important: The prompt needs to be in the same format the model was trained with.
# You can find an example prompt in the experiment logs.
prompt = "<|prompt|>How are you?</s><|answer|>"

tokenizer = AutoTokenizer.from_pretrained(
    model_name,
    use_fast=False,
    trust_remote_code=True,
)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map={"": "cuda:0"},
    trust_remote_code=True,
)
model.cuda().eval()
inputs = tokenizer(prompt, return_tensors="pt", add_special_tokens=False).to("cuda")

# generate configuration can be modified to your needs
tokens = model.generate(
    **inputs,
    min_new_tokens=2,
    max_new_tokens=1024,
    do_sample=False,
    num_beams=1,
    temperature=float(0.3),
    repetition_penalty=float(1.2),
    renormalize_logits=True
)[0]

tokens = tokens[inputs["input_ids"].shape[1]:]
answer = tokenizer.decode(tokens, skip_special_tokens=True)
print(answer)

📚 详细文档

模型架构

LlamaForCausalLM(
  (model): LlamaModel(
    (embed_tokens): Embedding(32000, 3200, padding_idx=0)
    (layers): ModuleList(
      (0-25): 26 x LlamaDecoderLayer(
        (self_attn): LlamaAttention(
          (q_proj): Linear(in_features=3200, out_features=3200, bias=False)
          (k_proj): Linear(in_features=3200, out_features=3200, bias=False)
          (v_proj): Linear(in_features=3200, out_features=3200, bias=False)
          (o_proj): Linear(in_features=3200, out_features=3200, bias=False)
          (rotary_emb): LlamaRotaryEmbedding()
        )
        (mlp): LlamaMLP(
          (gate_proj): Linear(in_features=3200, out_features=8640, bias=False)
          (down_proj): Linear(in_features=8640, out_features=3200, bias=False)
          (up_proj): Linear(in_features=3200, out_features=8640, bias=False)
          (act_fn): SiLUActivation()
        )
        (input_layernorm): LlamaRMSNorm()
        (post_attention_layernorm): LlamaRMSNorm()
      )
    )
    (norm): LlamaRMSNorm()
  )
  (lm_head): Linear(in_features=3200, out_features=32000, bias=False)
)

模型配置

本模型使用H2O LLM Studio进行训练，并采用了 cfg.yaml 中的配置。访问 H2O LLM Studio 以了解如何训练你自己的大语言模型。

🔧 技术细节

本模型基于openlm-research/open_llama_3b基础模型，使用OpenAssistant/oasst1数据集进行个性化训练。在训练过程中，通过特定的配置和预处理步骤，使得模型能够更好地适应特定的任务和数据。模型架构为LlamaForCausalLM，包含多个LlamaDecoderLayer，通过自注意力机制和多层感知机进行特征提取和文本生成。

📄 许可证

本模型采用 Apache-2.0 许可证。

⚠️ 免责声明

在使用本仓库提供的大语言模型之前，请仔细阅读本免责声明。使用该模型即表示你同意遵守以下条款和条件。

偏差与冒犯性：大语言模型是在各种互联网文本数据上进行训练的，这些数据可能包含有偏差、种族主义、冒犯性或其他不适当的内容。使用此模型即表示你承认并接受生成的内容有时可能会表现出偏差或产生冒犯性或不适当的内容。本仓库的开发者不认可、支持或推广任何此类内容或观点。
局限性：大语言模型是基于人工智能的工具，而非人类。它可能会产生不正确、无意义或不相关的回复。用户有责任批判性地评估生成的内容，并自行决定是否使用。
自担风险：使用此大语言模型的用户必须对使用该工具可能产生的任何后果承担全部责任。本仓库的开发者和贡献者不对因使用或滥用所提供的模型而导致的任何损害、损失或伤害承担责任。
道德考量：鼓励用户负责任且合乎道德地使用大语言模型。使用此模型即表示你同意不将其用于宣扬仇恨言论、歧视、骚扰或任何形式的非法或有害活动。
问题报告：如果你遇到大语言模型生成的任何有偏差、冒犯性或其他不适当的内容，请通过提供的渠道向仓库维护者报告。你的反馈将有助于改进模型并减轻潜在问题。
免责声明的变更：本仓库的开发者保留随时修改或更新本免责声明的权利，无需事先通知。用户有责任定期查看免责声明，以了解任何变更。

使用本仓库提供的大语言模型即表示你同意接受并遵守本免责声明中规定的条款和条件。如果你不同意本免责声明的任何部分，则应避免使用该模型及其生成的任何内容。