Rugpt_chitchat Open Source Model - Free Support for Russian Chats and Common Sense Reasoning Dialogues

Home

Rugpt Chitchat

Developed by inkoziev

A generative model supporting Russian casual conversation and common sense reasoning, based on GPT-2 architecture

Large Language Model

Transformers

Other#Russian casual conversation #Common sense reasoning #Mathematical calculation

Downloads 70

Release Time : 9/15/2022

Model Overview

This model serves as the core component of dialogue systems, featuring two main functions: generating Russian casual conversations and performing common sense reasoning based on given facts. It can solve simple arithmetic problems and logic questions

Model Features

Russian casual conversation

Capable of generating natural and fluent Russian dialogue responses, supporting multi-turn context understanding

Common sense reasoning

Performs logical reasoning and problem-solving based on given facts, supporting syllogism and other reasoning methods

Arithmetic capability

Can solve elementary-level (grades 1-2) arithmetic problems with a test accuracy of 91%

Information filtering

Automatically filters key facts from redundant information for responses

Model Capabilities

Russian text generation

Multi-turn dialogue

Common sense reasoning

Simple arithmetic calculation

Logical reasoning

Use Cases

Dialogue systems

Chatbot

Used to build Russian-language chatbots

Generates natural and fluent dialogue responses

Question answering systems

Fact-based Q&A

Answers questions based on given facts

91% accuracy in solving arithmetic problems

Logical reasoning

Performs simple syllogistic reasoning

Capable of handling reasoning problems with implicit premises

🚀 Russian Chit-chat, Deductive and Common Sense reasoning model

This model serves as the core of a prototype dialogue system with two main functions.

🚀 Quick Start

The model has two main functions:

✨ Features

Chat Replication Generation: It takes the dialogue history (1 - 10 previous utterances) as input to generate chat responses.
```
- Hi, how are you?
- Hi, not so good.
- <<< This is the response we expect from the model >>>
```
Answer Deduction: It can deduce answers to given questions based on additional facts or common sense. Relevant facts are assumed to be retrieved from an external knowledge base using another model, such as sbert_pq. The model will construct a grammatical and concise answer using the provided facts and question text.
```
- Today is September 15th. What month is it now?
- September
```
The model can also perform syllogistic reasoning and solve simple arithmetic problems.

📦 Installation

No specific installation steps are provided in the original document, so this section is skipped.

💻 Usage Examples

Basic Usage

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM


device = "cuda" if torch.cuda.is_available() else "cpu"
model_name = "inkoziev/rugpt_chitchat"
tokenizer = AutoTokenizer.from_pretrained(model_name)
tokenizer.add_special_tokens({'bos_token': '<s>', 'eos_token': '</s>', 'pad_token': '<pad>'})
model = AutoModelForCausalLM.from_pretrained(model_name)
model.to(device)
model.eval()

# Input the last 2 - 3 dialogue utterances. Each utterance starts with "-" on a separate line
input_text = """<s>- Hi! What are you doing?
- Hi :) I'm in a taxi
-"""

encoded_prompt = tokenizer.encode(input_text, add_special_tokens=False, return_tensors="pt").to(device)

output_sequences = model.generate(input_ids=encoded_prompt, max_length=100, num_return_sequences=1, pad_token_id=tokenizer.pad_token_id)

text = tokenizer.decode(output_sequences[0].tolist(), clean_up_tokenization_spaces=True)[len(input_text)+1:]
text = text[: text.find('</s>')]
print(text)

📚 Documentation

Model Variants and Metrics

The currently released model has 760 million parameters, similar to sberbank-ai/rugpt3large_based_on_gpt2. The following table shows the accuracy of solving arithmetic problems on a held - out test set:

Property	Details
Model Type	The model has two main functions: chat response generation and answer deduction based on facts or common sense. It can also perform syllogistic reasoning and solve simple arithmetic problems.
Training Data	Not provided in the original document.
Arithmetic Accuracy
base model	arith. accuracy
---------------------------------------	---------------
sberbank-ai/rugpt3large_based_on_gpt2	0.91
sberbank-ai/rugpt3medium_based_on_gpt2	0.70
sberbank-ai/rugpt3small_based_on_gpt2	0.58
tinkoff-ai/ruDialoGPT-small	0.44
tinkoff-ai/ruDialoGPT-medium	0.69

📄 License

The model is under the unlicense license.

Contacts

If you have any questions about using this model or suggestions for its improvement, please email mentalcomputing@gmail.com

Citation:

@MISC{rugpt_chitchat,
    author  = {Ilya Koziev},
    title   = {Russian Chit-chat with Common sence Reasoning},
    url     = {https://huggingface.co/inkoziev/rugpt_chitchat},
    year    = 2022
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご