đ Pygmalion-2 7B
An instruction-tuned Llama-2 biased towards fiction writing and conversation.
đ Quick Start
This README provides detailed information about the Pygmalion-2 7B model, including its features, training data, prompting methods, limitations, and acknowledgments.
⨠Features
- Instruction-Tuned: Based on Llama-2 7B, Pygmalion-2 7B is instruction-tuned for better performance in conversation, roleplaying, and storywriting.
- Fiction and Conversation Bias: The model is biased towards fiction writing and conversation, making it suitable for entertainment purposes.
- Flexible Prompting: It supports a prompting format with three roles (
<|system|>
, <|user|>
, <|model|>
) to guide the model's responses.
đĻ Installation
No installation steps are provided in the original document, so this section is skipped.
đģ Usage Examples
Basic Usage
The model uses a specific prompting format with three roles: <|system|>
, <|user|>
, and <|model|>
.
<|system|>Enter RP mode. Pretend to be {{char}} whose persona follows:
{{persona}}
You shall reply to the user while staying in character, and generate long responses.
Advanced Usage
The system prompt can be used to enter different modes and control the reply length, allowing for more customized interactions.
đ Documentation
Model Details
The long-awaited release of our new models based on Llama-2 is finally here. Pygmalion-2 7B (formerly known as Metharme) is based on Llama-2 7B released by Meta AI.
The Metharme models were an experiment to try and get a model that is usable for conversation, roleplaying and storywriting, but which can be guided using natural language like other instruct models. After much deliberation, we reached the conclusion that the Metharme prompting format is superior (and easier to use) compared to the classic Pygmalion.
This model was trained by doing supervised fine-tuning over a mixture of regular instruction data alongside roleplay, fictional stories and conversations with synthetically generated instructions attached.
This model is freely available for both commercial and non-commercial use, as per the Llama-2 license.
Prompting
The model has been trained on prompts using three different roles, which are denoted by the following tokens: <|system|>
, <|user|>
and <|model|>
.
The <|system|>
prompt can be used to inject out-of-channel information behind the scenes, while the <|user|>
prompt should be used to indicate user input. The <|model|>
token should then be used to indicate that the model should generate a response. These tokens can happen multiple times and be chained up to form a conversation history.
Dataset
The dataset used to fine-tune this model includes our own PIPPA, along with several other instruction datasets, and datasets acquired from various RP forums.
Limitations and biases
The intended use-case for this model is fictional writing for entertainment purposes. Any other sort of usage is out of scope.
As such, it was not fine-tuned to be safe and harmless: the base model and this fine-tune have been trained on data known to contain profanity and texts that are lewd or otherwise offensive. It may produce socially unacceptable or undesirable text, even if the prompt itself does not include anything explicitly offensive. Outputs might often be factually wrong or misleading.
Acknowledgements
We would like to thank SpicyChat for sponsoring the training for this model.

đ License
This model is licensed under the Llama-2 license, allowing for both commercial and non-commercial use.