Pythia-31M-Chat-v1 Open Source Dialogue Model - Free Support for Multi-round Conversations and Various Task Scenarios

Pythia 31M Chat V1

Developed by Felladrin

A 31-million parameter conversational model fine-tuned from EleutherAI/pythia-31m, supporting multi-turn dialogues and various task scenarios

Large Language Model

Transformers

EnglishOpen Source License:Apache-2.0 #Lightweight conversational model #Multi-domain consultation #Career development guidance

Downloads 1,532

Release Time : 1/6/2024

Model Overview

This is a conversational model fine-tuned from Pythia-31m, focusing on text generation tasks and capable of handling diverse dialogue scenarios such as career consultation and knowledge Q&A.

Model Features

Multi-scenario dialogue support

Fine-tuned on various dialogue datasets, capable of handling diverse scenarios like career consultation and knowledge Q&A

Lightweight model

Only 31 million parameters, suitable for deployment in resource-limited environments

Multi-format support

Offers GGUF and ONNX formats for easy deployment across different platforms

Model Capabilities

Multi-turn dialogue

Career consultation

Knowledge Q&A

Health advice

Technical explanations

Use Cases

Career consultation

Software development career guidance

Provides career development advice for users interested in software development

Knowledge Q&A

Quantum computing applications explained

Answers questions about potential applications of quantum computing

Health advice

Healthy lifestyle suggestions

Provides step-by-step advice for becoming a healthier individual

🚀 A Pythia Chat Model of 31M Parameters

This is a text - generation model based on the EleutherAI/pythia - 31m base model. It provides different ML formats and has been trained on multiple datasets.

🚀 Quick Start

Model Information

Base model: EleutherAI/pythia-31m
Availability in other ML formats:
- GGUF: Felladrin/gguf-Pythia-31M-Chat-v1
- ONNX: Felladrin/onnx-Pythia-31M-Chat-v1

Recommended prompt format

<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{user_message}<|im_end|>
<|im_start|>assistant

Recommended inference parameters

penalty_alpha: 0.5
top_k: 2
repetition_penalty: 1.0016

✨ Features

This model can be used for text generation tasks. The widget example shows its application in different scenarios such as career counseling, answering questions about quantum computing applications, and providing advice on health improvement.

📦 Installation

No specific installation steps are provided in the original document, so this section is skipped.

💻 Usage Examples

Basic Usage

The widget examples demonstrate basic usage scenarios:

# The widget example shows different message interactions
widget = {
  "messages": [
    {
      "role": "system",
      "content": "You are a career counselor. The user will provide you with an individual looking for guidance in their professional life, and your task is to assist them in determining what careers they are most suited for based on their skills, interests, and experience. You should also conduct research into the various options available, explain the job market trends in different industries, and advice on which qualifications would be beneficial for pursuing particular fields."
    },
    {
      "role": "user",
      "content": "Heya!"
    },
    {
      "role": "assistant",
      "content": "Hi! How may I help you?"
    },
    {
      "role": "user",
      "content": "I am interested in developing a career in software engineering. What would you recommend me to do?"
    },
    # Other message examples...
  ]
}

Advanced Usage

The following code shows the training process of the model:

# SFT Training
SFTTrainer(
    model,
    train_dataset=train_dataset,
    dataset_text_field="text",
    eval_dataset=eval_dataset,
    max_seq_length=2048,
    packing=True,
    args=TrainingArguments(
        learning_rate=2e-6,
        per_device_train_batch_size=1,
        per_device_eval_batch_size=1,
        gradient_accumulation_steps=16,
        lr_scheduler_type="cosine",
        num_train_epochs=1,
        logging_strategy="steps",
        save_strategy="steps",
        evaluation_strategy="steps",
        logging_steps=10,
        eval_steps=10,
        save_steps=10,
        warmup_steps=50,
        load_best_model_at_end=True,
        metric_for_best_model="eval_loss",
        greater_is_better=False,
        weight_decay=0.01,
        save_total_limit=10,
        neftune_noise_alpha=5,
    ),
    callbacks=[
        EarlyStoppingCallback(
            early_stopping_patience=3,
            early_stopping_threshold=0.005
        ),
    ],
)

# DPO Training
DPOTrainer(
    model,
    beta=0.1,
    train_dataset=dataset,
    tokenizer=tokenizer,
    eval_dataset=eval_dataset,
    max_length=1536,
    max_prompt_length=1024,
    args=TrainingArguments(
        learning_rate=2e-6,
        per_device_train_batch_size=1,
        per_device_eval_batch_size=1,
        gradient_accumulation_steps=1,
        lr_scheduler_type="cosine",
        num_train_epochs=1,
        logging_strategy="steps",
        save_strategy="steps",
        evaluation_strategy="steps",
        logging_steps=1,
        eval_steps=1,
        save_steps=1,
        warmup_steps=0,
        load_best_model_at_end=True,
        metric_for_best_model="eval_loss",
        greater_is_better=False,
        weight_decay=0.0,
        neftune_noise_alpha=5,
        remove_unused_columns=False,
    ),
    callbacks=[
        EarlyStoppingCallback(
            early_stopping_patience=3,
            early_stopping_threshold=0.005
        ),
    ],
)

📚 Documentation

Datasets and parameters used for training

Property	Details
Model Type	Text Generation
Training Data	totally-not-an-llm/EverythingLM-data-V3 (mit), databricks/databricks-dolly-15k (cc-by-sa-3.0), THUDM/webglm-qa (apache-2.0), starfishmedical/webGPT_x_dolly (cc-by-sa-3.0), Amod/mental_health_counseling_conversations (openrail), sablo/oasst2_curated (apache-2.0), cognitivecomputations/wizard_vicuna_70k_unfiltered (apache-2.0), mlabonne/chatml_dpo_pairs (apache-2.0)

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.92
AI2 Reasoning Challenge (25 - Shot)	22.70
HellaSwag (10 - Shot)	25.60
MMLU (5 - Shot)	23.24
TruthfulQA (0 - shot)	0.00
Winogrande (5 - shot)	47.99
GSM8k (5 - shot)	0.00

📄 License

The model is licensed under the Apache - 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご