Pythia-2.8B去重合成指令模型开源！高效生成精准指令内容

首页

Pythia 2.8b Deduped Synthetic Instruct

由 lambdalabs 开发

基于Pythia-2.8B去重版本微调的指令生成模型，针对合成指令数据集优化

大型语言模型

Transformers

英语开源协议:Apache-2.0 #指令微调 #英语问答 #合成数据训练

下载量 46

发布时间 : 3/4/2023

模型简介

该模型是基于Pythia-2.8B去重版本在合成指令数据集上微调的语言模型，擅长生成符合指令的文本响应

模型特点

指令微调优化

在合成指令数据集上微调，增强了遵循指令和生成响应能力

高效推理

约需7GB显存即可运行推理，相对高效

停止标记支持

支持自定义停止标记，便于控制生成文本长度

模型能力

文本生成

指令响应

问答生成

使用案例

教育辅助

教学指导生成

生成分步骤的教学指导，如烹饪方法

示例中展示了详细的煎蛋卷制作步骤

虚拟助手

任务指导

回答用户关于如何完成特定任务的问题

🚀 合成指令微调Pythia 2.8B模型

本项目基于预训练模型进行微调，得到了一个强大的语言模型。该模型在特定数据集上进行训练，能够在多种自然语言处理任务中表现出色，为用户提供高质量的文本生成服务。

🚀 快速开始

运行环境

运行该模型推理大约需要7GB的GPU内存。

代码示例

import torch

from transformers import AutoTokenizer, pipeline, StoppingCriteria, StoppingCriteriaList

device = torch.device("cuda:0") if torch.cuda.is_available() else torch.device("cpu")

model_name = "lambdalabs/pythia-2.8b-deduped-synthetic-instruct"
max_new_tokens = 2048
stop_token = "<|stop|>"


class KeywordsStoppingCriteria(StoppingCriteria):
    def __init__(self, keywords_ids: list):
        self.keywords = keywords_ids

    def __call__(
        self, input_ids: torch.LongTensor, scores: torch.FloatTensor, **kwargs
    ) -> bool:
        if input_ids[0][-1] in self.keywords:
            return True
        return False


tokenizer = AutoTokenizer.from_pretrained(
    model_name,
)
tokenizer.pad_token = tokenizer.eos_token
tokenizer.add_tokens([stop_token])

stop_ids = [tokenizer.encode(w)[0] for w in [stop_token]]
stop_criteria = KeywordsStoppingCriteria(stop_ids)

generator = pipeline(
    "text-generation",
    model=model_name,
    device=device,
    max_new_tokens=max_new_tokens,
    torch_dtype=torch.float16,
    stopping_criteria=StoppingCriteriaList([stop_criteria]),
)

example = "How can I make an omelette."
text = "Question: {}\nAnswer:".format(example)

result = generator(
    text,
    num_return_sequences=1,
)

output = result[0]["generated_text"]

print(output)

输出示例

Question: How can I make an omelette.
Answer:To make an omelette, start by cracking two eggs into a bowl and whisking them together. Add a splash of milk and a pinch of salt and pepper. Heat a non-stick pan over medium-high heat and add a tablespoon of butter. Once the butter has melted, pour in the egg mixture. As the eggs set, use a spatula to lift the edges and let the uncooked egg run underneath. When the eggs are cooked through and no visible liquid egg remains, top with your desired fillings and fold the omelette in half before sliding it onto a plate.<|stop|>

✨ 主要特性

基于Transformer架构，在自然语言处理任务中具有强大的性能。
在特定数据集上进行微调，能够生成高质量的文本。

📦 安装指南

文档未提及具体安装步骤，故跳过该章节。

📚 详细文档

模型详情

属性	详情
微调团队	Lambda
模型类型	基于Transformer的语言模型
语言	英语
预训练模型	EleutherAI/pythia-2.8b-deduped
训练数据集	Dahoas/synthetic-instruct-gptj-pairwise
依赖库	transformers
许可证	Apache 2.0