Phi-2-Persona-Chat Open-Source Personalized Dialogue Model - Supports Generating Natural Language According to Character Settings

Phi 2 Persona Chat

Developed by nazlicanto

A personalized dialogue model fine-tuned via LoRA based on the Phi-2 model, supporting natural language generation with character settings

Large Language Model

Transformers

EnglishOpen Source License:MIT #Personified Dialogue #LoRA Fine-tuning #Multi-turn Context

Downloads 26

Release Time : 4/2/2024

Model Overview

This model is fine-tuned with over 64,000 sets of dialogue data containing character settings, capable of generating responses that align with specific personality traits

Model Features

Personified Dialogue Generation

Generates dialogue responses that match given character traits

LoRA Fine-tuning

Efficiently fine-tunes the base Phi-2 model using LoRA technology

Context Awareness

Understands dialogue context and maintains logical coherence

Model Capabilities

Text generation

Personalized dialogue

Context understanding

Character simulation

Use Cases

Dialogue Systems

Personalized Chatbots

Generates dialogues for virtual characters with distinct personalities

Produces natural language responses aligned with character settings

Role-playing Applications

Simulates character dialogues in gaming or educational applications

Enhances user experience and immersion

🚀 Phi 2 Persona-Chat

Phi 2 Persona-Chat is a LoRA fine-tuned model based on the Phi 2 base model, designed for persona-grounded chat scenarios, enhancing the relevance and personalization of conversations.

🚀 Quick Start

Phi 2 Persona-Chat is a LoRA fine-tuned version of the base Phi 2 model using the nazlicanto/persona-based-chat dataset. This dataset consists of over 64k conversations between Persona A and Persona B, for which a list of persona facts are provided.

The model is trained using Supervised Fine-tuning Trainer using the reference responses as target outputs. For the training and inference code and the full list of dependencies, you can refer to the Github repo.

✨ Features

Persona-grounded Chat: Leverage persona facts to generate more personalized and context-aware responses.
Fine-tuned on Diverse Conversations: Trained on a dataset with over 64k conversations, ensuring a wide range of conversational scenarios.

📦 Installation

No specific installation steps are provided in the original README. If you want to use this model, you can follow the general steps of using transformers library:

pip install transformers datasets torch

💻 Usage Examples

Basic Usage

from random import randrange

import torch
from datasets import load_dataset
from transformers import AutoTokenizer, AutoModelForCausalLM


prompt = f"""
Person B has the following Persona information.

Persona of Person B: My name is David and I'm a 35 year old math teacher.
Persona of Person B: I like to hike and spend time in the nature.
Persona of Person B: I'm married with two kids.

Instruct: Person A and Person B are now having a conversation.  Following the conversation below, write a response that Person B would say base on the above Persona information. Please carefully consider the flow and context of the conversation below, and use the Person B's Persona information appropriately to generate a response that you think are the most appropriate replying for Person B.

Persona A: Morning! I think I saw you at the parent meeting, what's your name?

Output:
"""

# load base LLM model, LoRA params and tokenizer
model = AutoModelForCausalLM.from_pretrained("nazlicanto/phi-2-persona-chat", trust_remote_code=True)
model.to("cuda")

tokenizer = AutoTokenizer.from_pretrained("nazlicanto/phi-2-persona-chat", trust_remote_code=True)

# tokenize input prompt
input_ids = tokenizer(prompt, return_tensors="pt", truncation=True).input_ids.cuda()

# inference
with torch.inference_mode():
    outputs = model.generate(
        input_ids=input_ids, 
        max_new_tokens=50, 
        do_sample=True, 
        top_p=0.1,
        temperature=0.7
    )

# decode output tokens
outputs = outputs.detach().cpu().numpy()
outputs = tokenizer.batch_decode(outputs, skip_special_tokens=True)
output = outputs[0][len(prompt):]
print(output)

Advanced Usage

There is no advanced usage example provided in the original README. You can further customize the generate parameters according to your needs, such as adjusting max_new_tokens, top_p, temperature to control the length and randomness of the generated text.

📚 Documentation

Please note that, at the moment, trust_remote_code=True is required for running the Phi 2 model. For best results, use a prompt that includes the persona facts, followed by a minimum of one conversational turn.

📄 License

This project is licensed under the MIT License.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご