Pythia-2.8B去重合成指令模型開源！高效生成精準指令內容

首頁

Pythia 2.8b Deduped Synthetic Instruct

由lambdalabs開發

基於Pythia-2.8B去重版本微調的指令生成模型，針對合成指令數據集優化

大型語言模型

Transformers

英語開源協議:Apache-2.0 #指令微調 #英語問答 #合成數據訓練

下載量 46

發布時間 : 3/4/2023

模型概述

該模型是基於Pythia-2.8B去重版本在合成指令數據集上微調的語言模型，擅長生成符合指令的文本響應

模型特點

指令微調優化

在合成指令數據集上微調，增強了遵循指令和生成響應能力

高效推理

約需7GB顯存即可運行推理，相對高效

停止標記支持

支持自定義停止標記，便於控制生成文本長度

模型能力

文本生成

指令響應

問答生成

使用案例

教育輔助

教學指導生成

生成分步驟的教學指導，如烹飪方法

示例中展示了詳細的煎蛋卷製作步驟

虛擬助手

任務指導

回答用戶關於如何完成特定任務的問題

🚀 合成指令微調Pythia 2.8B模型

本項目基於預訓練模型進行微調，得到了一個強大的語言模型。該模型在特定數據集上進行訓練，能夠在多種自然語言處理任務中表現出色，為用戶提供高質量的文本生成服務。

🚀 快速開始

運行環境

運行該模型推理大約需要7GB的GPU內存。

代碼示例

import torch

from transformers import AutoTokenizer, pipeline, StoppingCriteria, StoppingCriteriaList

device = torch.device("cuda:0") if torch.cuda.is_available() else torch.device("cpu")

model_name = "lambdalabs/pythia-2.8b-deduped-synthetic-instruct"
max_new_tokens = 2048
stop_token = "<|stop|>"


class KeywordsStoppingCriteria(StoppingCriteria):
    def __init__(self, keywords_ids: list):
        self.keywords = keywords_ids

    def __call__(
        self, input_ids: torch.LongTensor, scores: torch.FloatTensor, **kwargs
    ) -> bool:
        if input_ids[0][-1] in self.keywords:
            return True
        return False


tokenizer = AutoTokenizer.from_pretrained(
    model_name,
)
tokenizer.pad_token = tokenizer.eos_token
tokenizer.add_tokens([stop_token])

stop_ids = [tokenizer.encode(w)[0] for w in [stop_token]]
stop_criteria = KeywordsStoppingCriteria(stop_ids)

generator = pipeline(
    "text-generation",
    model=model_name,
    device=device,
    max_new_tokens=max_new_tokens,
    torch_dtype=torch.float16,
    stopping_criteria=StoppingCriteriaList([stop_criteria]),
)

example = "How can I make an omelette."
text = "Question: {}\nAnswer:".format(example)

result = generator(
    text,
    num_return_sequences=1,
)

output = result[0]["generated_text"]

print(output)

輸出示例

Question: How can I make an omelette.
Answer:To make an omelette, start by cracking two eggs into a bowl and whisking them together. Add a splash of milk and a pinch of salt and pepper. Heat a non-stick pan over medium-high heat and add a tablespoon of butter. Once the butter has melted, pour in the egg mixture. As the eggs set, use a spatula to lift the edges and let the uncooked egg run underneath. When the eggs are cooked through and no visible liquid egg remains, top with your desired fillings and fold the omelette in half before sliding it onto a plate.<|stop|>

✨ 主要特性

基於Transformer架構，在自然語言處理任務中具有強大的性能。
在特定數據集上進行微調，能夠生成高質量的文本。

📦 安裝指南

文檔未提及具體安裝步驟，故跳過該章節。

📚 詳細文檔

模型詳情

屬性	詳情
微調團隊	Lambda
模型類型	基於Transformer的語言模型
語言	英語
預訓練模型	EleutherAI/pythia-2.8b-deduped
訓練數據集	Dahoas/synthetic-instruct-gptj-pairwise
依賴庫	transformers
許可證	Apache 2.0