phi-2-persona-chat開源個性化對話模型 - 支持按人物設定生成自然語言

首頁

Phi 2 Persona Chat

由nazlicanto開發

基於Phi-2模型通過LoRA微調的個性化對話模型，支持基於人物設定的自然語言生成

大型語言模型

Transformers

英語開源協議:MIT #人格化對話 #LoRA微調 #多輪上下文

下載量 26

發布時間 : 4/2/2024

模型概述

該模型通過超過6.4萬組帶人物設定的對話數據進行微調，能夠生成符合特定人物性格特徵的對話回應

模型特點

人格化對話生成

能夠根據給定的人物設定生成符合其性格特徵的對話回應

LoRA微調

採用LoRA技術對基礎Phi-2模型進行高效微調

上下文感知

能夠理解對話上下文並保持邏輯連貫性

模型能力

文本生成

個性化對話

上下文理解

人物設定模擬

使用案例

對話系統

個性化聊天機器人

為不同性格特徵的虛擬角色生成對話

生成符合人物設定的自然語言回應

角色扮演應用

在遊戲或教育應用中模擬特定角色對話

增強用戶體驗和沉浸感

🚀 Phi 2 人物角色聊天模型

Phi 2 人物角色聊天模型是基於基礎 Phi 2 模型，使用 nazlicanto/persona-based-chat 數據集進行 LoRA 微調後的版本。該數據集包含超過 64000 段 人物 A 和 人物 B 之間的對話，並提供了一系列人物角色事實信息。

此模型使用監督微調訓練器進行訓練，以 reference 回覆作為目標輸出。關於訓練和推理代碼以及完整的依賴項列表，你可以參考 Github 倉庫。

🚀 快速開始

運行模型

請注意，目前運行 Phi 2 模型需要設置 trust_remote_code=True。為獲得最佳效果，請使用包含人物角色事實信息的提示，並至少包含一輪對話。

from random import randrange

import torch
from datasets import load_dataset
from transformers import AutoTokenizer, AutoModelForCausalLM


prompt = f"""
Person B has the following Persona information.

Persona of Person B: My name is David and I'm a 35 year old math teacher.
Persona of Person B: I like to hike and spend time in the nature.
Persona of Person B: I'm married with two kids.

Instruct: Person A and Person B are now having a conversation.  Following the conversation below, write a response that Person B would say base on the above Persona information. Please carefully consider the flow and context of the conversation below, and use the Person B's Persona information appropriately to generate a response that you think are the most appropriate replying for Person B.

Persona A: Morning! I think I saw you at the parent meeting, what's your name?

Output:
"""

# load base LLM model, LoRA params and tokenizer
model = AutoModelForCausalLM.from_pretrained("nazlicanto/phi-2-persona-chat", trust_remote_code=True)
model.to("cuda")

tokenizer = AutoTokenizer.from_pretrained("nazlicanto/phi-2-persona-chat", trust_remote_code=True)

# tokenize input prompt
input_ids = tokenizer(prompt, return_tensors="pt", truncation=True).input_ids.cuda()

# inference
with torch.inference_mode():
    outputs = model.generate(
        input_ids=input_ids, 
        max_new_tokens=50, 
        do_sample=True, 
        top_p=0.1,
        temperature=0.7
    )

# decode output tokens
outputs = outputs.detach().cpu().numpy()
outputs = tokenizer.batch_decode(outputs, skip_special_tokens=True)
output = outputs[0][len(prompt):]
print(output)