ConvoSenseGenerator Open-source Model - Free Generation of 10 Common Social Common Sense Inferences for Dialogue Contexts

Convosensegenerator

Developed by sefinch

ConvoSenseGenerator is a generative model capable of producing commonsense reasoning for dialogue contexts, covering 10 common types of social commonsense.

Dialogue System

Transformers

EnglishOpen Source License:Apache-2.0 #Multi-type commonsense reasoning #Dialogue context understanding #High-diversity generation

Downloads 32

Release Time : 1/24/2024

Model Overview

Based on the T5-3b architecture, this model can generate high-quality commonsense reasoning for dialogue history, including emotional reactions, motivations, causes, subsequent events, and more.

Model Features

Multi-type commonsense reasoning

Supports 10 different types of commonsense reasoning, including emotional reactions, motivations, cause analysis, etc.

High-quality generation

Generated reasoning is rated by humans as highly plausible, with high novelty information rate and high detail level.

Diverse generation

Supports diverse outputs through beam search and diversity penalty parameters.

Model Capabilities

Dialogue context understanding

Commonsense reasoning generation

Multi-type reasoning output

Diverse result generation

Use Cases

Dialogue system enhancement

Chatbot commonsense enhancement

Adds commonsense reasoning capabilities to chatbots, making their responses more reasonable.

Improves the naturalness and plausibility of dialogues.

Social analysis

Dialogue emotion analysis

Analyzes implicit emotions and motivations in dialogues.

Deeper understanding of the psychological states of dialogue participants.

🚀 ConvoSenseGenerator: A Commonsense Inference Model

ConvoSenseGenerator is a generative model designed to generate commonsense inferences for dialogue contexts. It covers 10 common social commonsense types, including emotional reactions, motivations, causes, subsequent events, and more. This model is trained on the large - scale dataset ConvoSense, which is synthetically collected using ChatGPT3.5. The inferences produced by ConvoSenseGenerator are highly reasonable, rich in novel information for corresponding dialogue contexts, and detailed, outperforming models trained on previous human - written datasets.

🚀 Quick Start

ConvoSenseGenerator is a powerful tool for generating commonsense inferences in dialogue contexts. Here's how you can get started with it.

✨ Features

Diverse Commonsense Coverage: Covers 10 common social commonsense types.
High - Quality Inferences: Produces inferences with high reasonability, novel information, and detail.
Trained on Large - Scale Dataset: Trained on the ConvoSense dataset collected synthetically using ChatGPT3.5.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

ConvoSenseGenerator covers the following commonsense types, using the provided questions:

commonsense_questions = {
    "cause": 'What could have caused the last thing said to happen?', 
    "prerequisities": 'What prerequisites are required for the last thing said to occur?', 
    "motivation": 'What is an emotion or human drive that motivates Speaker based on what they just said?', 
    "subsequent": 'What might happen after what Speaker just said?', 
    "desire": 'What does Speaker want to do next?',
    "desire_o": 'What will Listener want to do next based on what Speaker just said?',
    "react": 'How is Speaker feeling after what they just said?',
    "react_o": 'How does Listener feel because of what Speaker just said?',
    "attribute": 'What is a likely characteristic of Speaker based on what they just said?',
    "constituents": 'What is a breakdown of the last thing said into a series of required subevents?' 
}

The best - performing configuration of ConvoSenseGenerator according to the experiments in the paper uses the following generation hyperparameters:

generation_config = {
    "repetition_penalty": 1.0,
    "num_beams": 10,
    "num_beam_groups": 10,
    "diversity_penalty": 0.5
}

Below is a simple code snippet to get ConvoSenseGenerator running:

import torch
from transformers import AutoTokenizer, T5ForConditionalGeneration

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
tokenizer = AutoTokenizer.from_pretrained("sefinch/ConvoSenseGenerator")
model = T5ForConditionalGeneration.from_pretrained("sefinch/ConvoSenseGenerator").to(device)

# ConvoSenseGenerator covers these commonsense types, using the provided questions
commonsense_questions = {
    "cause": 'What could have caused the last thing said to happen?', 
    "prerequisities": 'What prerequisites are required for the last thing said to occur?', 
    "motivation": 'What is an emotion or human drive that motivates Speaker based on what they just said?', 
    "subsequent": 'What might happen after what Speaker just said?', 
    "desire": 'What does Speaker want to do next?',
    "desire_o": 'What will Listener want to do next based on what Speaker just said?',
    "react": 'How is Speaker feeling after what they just said?',
    "react_o": 'How does Listener feel because of what Speaker just said?',
    "attribute": 'What is a likely characteristic of Speaker based on what they just said?',
    "constituents": 'What is a breakdown of the last thing said into a series of required subevents?' 
}

def format_input(conversation_history, commonsense_type):

    # prefix last turn with Speaker, and alternately prefix each previous turn with either Listener or Speaker
    prefixed_turns = list(
        reversed(
            [
                f"{'Speaker' if i % 2 == 0 else 'Listener'}: {u}"
                for i, u in enumerate(reversed(conversation_history))
            ]
        )
    )

    # model expects a maximum of 7 total conversation turns to be given
    truncated_turns = prefixed_turns[-7:]

    # conversation representation separates the turns with newlines
    conversation_string = '\n'.join(truncated_turns)

    # format the full input including the commonsense question
    input_text = f"provide a reasonable answer to the question based on the dialogue:\n{conversation_string}\n\n[Question] {commonsense_questions[commonsense_type]}\n[Answer]"

    return input_text

def generate(conversation_history, commonsense_type):
    # convert the input into the expected format to run the model
    input_text = format_input(conversation_history, commonsense_type) 

    # tokenize the input_text
    inputs = tokenizer([input_text], return_tensors="pt").to(device)

    # get multiple model generations using the best-performing generation configuration (based on experiments detailed in paper)
    outputs = model.generate(
        inputs["input_ids"],
        repetition_penalty=1.0,
        num_beams=10,
        num_beam_groups=10,
        diversity_penalty=0.5,
        num_return_sequences=5,
        max_new_tokens=400
    )

    # decode the generated inferences
    inferences = tokenizer.batch_decode(outputs, skip_special_tokens=True, clean_up_tokenization_spaces=False)

    return inferences

conversation = [
    "Hey, I'm trying to convince my parents to get a dog, but they say it's too much work.",
    "Well, you could offer to do everything for taking care of it. Have you tried that?",
    "But I don't want to have to take the dog out for walks when it is the winter!"
]

inferences = generate(conversation, "cause")
print('\n'.join(inferences))

# Outputs:
# the speaker's fear of the cold and the inconvenience of having to take the dog out in the winter.
# the speaker's preference for indoor activities during winter, such as watching movies or playing video games.
# the speaker's fear of getting sick from taking the dog out in the cold.
# a previous negative experience with taking dogs for walks in the winter.
# the listener's suggestion to offer to help with taking care of the dog, which the speaker may have considered but was not willing to do.

📚 Documentation

Model Description

Repository: Code
Paper: ConvoSense: Overcoming Monotonous Commonsense Inferences for Conversational AI
Point of Contact: Sarah E. Finch

Model Training

ConvoSenseGenerator is trained on our recent dataset: ConvoSense. The backbone model of ConvoSenseGenerator is T5 - 3b.

🔧 Technical Details

The model is trained on the ConvoSense dataset, which is synthetically collected using ChatGPT3.5. The backbone model is T5 - 3b. The best - performing configuration uses specific generation hyperparameters such as repetition_penalty = 1.0, num_beams = 10, num_beam_groups = 10, and diversity_penalty = 0.5.

📄 License

This project is licensed under the Apache - 2.0 license.

Citation

Please cite our work if you find the resources in this repository useful:

@article{convosense_finch:24,
    author = {Finch, Sarah E. and Choi, Jinho D.},
    title = "{ConvoSense: Overcoming Monotonous Commonsense Inferences for Conversational AI}",
    journal = {Transactions of the Association for Computational Linguistics},
    volume = {12},
    pages = {467-483},
    year = {2024},
    month = {05},
    issn = {2307-387X},
    doi = {10.1162/tacl_a_00659},
    url = {https://doi.org/10.1162/tacl\_a\_00659},
    eprint = {https://direct.mit.edu/tacl/article-pdf/doi/10.1162/tacl\_a\_00659/2369521/tacl\_a\_00659.pdf},
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご