Phi-Hermes-1.3B Open-Source Text Generation Model - Free Deployment to Assist with Diverse Text Creation

Home

Phi Hermes 1.3B

Developed by teknium

Phi-1.5 model fine-tuned on the Hermes dataset, primarily used for text generation tasks

Large Language Model

Transformers

EnglishOpen Source License:Other #Phi-1.5 fine-tuning #Alpaca instruction format #English text generation

Downloads 45

Release Time : 9/13/2023

Model Overview

This is a large language model based on Microsoft's Phi-1.5 architecture, fine-tuned using the OpenHermes dataset. The dataset contains over 240,000 synthetic data points primarily generated by GPT-4.

Model Features

High-quality fine-tuning data

Fine-tuned using over 240,000 synthetic data points generated by GPT-4

Efficient inference

Based on the Phi-1.5 architecture, improving inference efficiency while maintaining performance

Alpaca-style prompts

Supports standard Alpaca-style prompt formats

Model Capabilities

Text generation

Instruction following

Content creation

Use Cases

Content generation

Social media content creation

Generate comments or posts for social media platforms like Twitter

Can generate social media content that meets requirements

Creative writing

Generate creative content such as stories and poems

Writing assistance

Text rewriting

Help users rewrite or optimize existing text

🚀 Model Card for Puffin-Phi V2

Phi-1.5 fine tuned with Hermes Dataset, offering enhanced text generation capabilities.

🚀 Quick Start

Phi does not support device_map "auto", and does not seem to want to inference in fp16, so use bf16.

Here is working code to inference, though it can be improved:

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("teknium/Puffin-Phi-v2", trust_remote_code=True, torch_dtype=torch.bfloat16).to("cuda")
tokenizer = AutoTokenizer.from_pretrained("teknium/Puffin-Phi-v2", trust_remote_code=True, torch_dtype=torch.bfloat16)
inputs = tokenizer(f"### Instruction:\nWrite a negative review for the website, Twitter.\n### Response:\n", return_tensors="pt", return_attention_mask=False)
outputs = model.generate(**inputs, max_length=128, do_sample=True, temperature=0.2, top_p=0.9, use_cache=True, repetition_penalty=1.2, eos_token_id=tokenizer.eos_token_id)
text = tokenizer.batch_decode(outputs)[0]
print(text)

The prompt format is Alpaca, then is prompted like so:

### Instruction:
<prompt>
### Response:

✨ Features

This model was fine - tuned from Phi - 1.5 with the Hermes Dataset, leveraging a large number of synthetic datapoints.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("teknium/Puffin-Phi-v2", trust_remote_code=True, torch_dtype=torch.bfloat16).to("cuda")
tokenizer = AutoTokenizer.from_pretrained("teknium/Puffin-Phi-v2", trust_remote_code=True, torch_dtype=torch.bfloat16)
inputs = tokenizer(f"### Instruction:\nWrite a negative review for the website, Twitter.\n### Response:\n", return_tensors="pt", return_attention_mask=False)
outputs = model.generate(**inputs, max_length=128, do_sample=True, temperature=0.2, top_p=0.9, use_cache=True, repetition_penalty=1.2, eos_token_id=tokenizer.eos_token_id)
text = tokenizer.batch_decode(outputs)[0]
print(text)

📚 Documentation

Model Details

Model Sources

This model was trained on the OpenHermes Dataset, made by me, which is over 240,000 mostly GPT - 4 generated synthetic datapoints

image/png

Uses

Let me know!

Training Details

Training Procedure

Trained with Axolotl. View the wandb runs for all my puffin runs (this is puffin - phi - 4 on wandb): https://wandb.ai/teknium1/hermes-phi/runs/hermes-phi-1

Evaluation

image/png

📄 License

License: other

Property	Details
Pipeline Tag	text-generation
Datasets	teknium/openhermes

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご