chatgpt_paraphraser_on_T5_baseオープンソーステキスト言い換えモデル - 高品質の言い換えテキストを無料で生成

ホーム

Chatgpt Paraphraser On T5 Base

humarinによって開発

T5-baseアーキテクチャで訓練されたテキスト言い換えモデル。高品質な言い換えテキストを生成可能で、Hugging Faceプラットフォームで最高の言い換えモデルの一つと評されている

テキスト生成

Transformers

英語オープンソースライセンス:Openrail #多文脈言い換え #高多様性生成 #T5アーキテクチャ最適化

ダウンロード数 115.08k

リリース時間 : 3/17/2023

モデル概要

このモデルは転移学習技術を用いてChatGPTの言い換え能力を模倣し、Quora、SQUAD 2.0、CNNニュースデータセットを統合して訓練されている。主にテキスト改作や言い換えタスクに使用される

モデル特徴

複数データセット訓練

Quora言い換え質問、SQUAD 2.0、CNNニュースの3つの高品質データセットを統合

高度な生成制御

ビームサーチ、多様性ペナルティなどの高度なテキスト生成パラメータ制御をサポート

高品質言い換え

転移学習でChatGPTの言い換え能力を模倣し、意味を保持した多様な表現を生成

モデル能力

テキスト言い換え

意味保持改作

多様表現生成

使用事例

コンテンツ作成

旅行ガイド改作

観光地説明を多様な表現で書き換え

5種類の異なる表現方法の観光地紹介を生成

ニュース要約改作

ニュース内容を非重複的に言い換え

元の意味を保持した複数の表現バージョン

教育支援

学習教材多様化

同一知識点に対して異なる表現バージョンを生成

学生が多角的に概念を理解するのを支援

🚀 ChatGPTパラフレーズモデル

このモデルは、ChatGPTと同様のパラフレーズ生成を行うことができ、Hugging Face上でも優れたパラフレーズモデルの一つです。T5-baseモデルをベースに、転移学習を用いて訓練されています。

🚀 クイックスタート

このモデルは、ChatGPTパラフレーズデータセットを使用して訓練されています。このデータセットは、Quoraパラフレーズ質問、SQUAD 2.0、CNNニュースデータセットに基づいています。

このモデルはT5-baseモデルをベースにしており、「転移学習」を用いてChatGPTと同様にパラフレーズを生成できるようにしました。現在、これはHugging Faceで最良のパラフレーズモデルの一つです。

Kaggleのリンク

著者1のLinkedInのリンク著者2のLinkedInのリンク

✨ 主な機能

転移学習を用いて、ChatGPTと同様のパラフレーズ生成が可能。
T5-baseモデルをベースにしており、高性能なパラフレーズ生成が可能。

📦 インストール

このモデルを使用するには、transformersライブラリが必要です。以下のコマンドでインストールできます。

pip install transformers

💻 使用例

基本的な使用法

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

device = "cuda"

tokenizer = AutoTokenizer.from_pretrained("humarin/chatgpt_paraphraser_on_T5_base")

model = AutoModelForSeq2SeqLM.from_pretrained("humarin/chatgpt_paraphraser_on_T5_base").to(device)

def paraphrase(
    question,
    num_beams=5,
    num_beam_groups=5,
    num_return_sequences=5,
    repetition_penalty=10.0,
    diversity_penalty=3.0,
    no_repeat_ngram_size=2,
    temperature=0.7,
    max_length=128
):
    input_ids = tokenizer(
        f'paraphrase: {question}',
        return_tensors="pt", padding="longest",
        max_length=max_length,
        truncation=True,
    ).input_ids.to(device)
    
    outputs = model.generate(
        input_ids, temperature=temperature, repetition_penalty=repetition_penalty,
        num_return_sequences=num_return_sequences, no_repeat_ngram_size=no_repeat_ngram_size,
        num_beams=num_beams, num_beam_groups=num_beam_groups,
        max_length=max_length, diversity_penalty=diversity_penalty
    )

    res = tokenizer.batch_decode(outputs, skip_special_tokens=True)

    return res

具体的な入出力例

入力:

text = 'What are the best places to see in New York?'
paraphrase(text)

出力:

['What are some must-see places in New York?',
 'Can you suggest some must-see spots in New York?',
 'Where should one go to experience the best NYC has to offer?',
 'Which places should I visit in New York?',
 'What are the top destinations to explore in New York?']

入力:

text = "Rammstein's album Mutter was recorded in the south of France in May and June 2000, and mixed in Stockholm in October of that year."
paraphrase(text)

出力:

['In May and June 2000, Rammstein travelled to the south of France to record his album Mutter, which was mixed in Stockholm in October of that year.',
 'The album Mutter by Rammstein was recorded in the south of France during May and June 2000, with mixing taking place in Stockholm in October of that year.',
 'The album Mutter by Rammstein was recorded in the south of France during May and June 2000, with mixing taking place in Stockholm in October of that year. It',
 'Mutter, the album released by Rammstein, was recorded in southern France during May and June 2000, with mixing taking place between October and September.',
 'In May and June 2000, Rammstein recorded his album Mutter in the south of France, with the mix being made at Stockholm during October.']

🔧 技術詳細

推論パラメータ

パラメータ	値
num_beams	5
num_beam_groups	5
num_return_sequences	5
repetition_penalty	10.01
diversity_penalty	3.01
no_repeat_ngram_size	2
temperature	0.7
max_length	128

訓練パラメータ

epochs = 5
batch_size = 64
max_length = 128
lr = 5e-5
batches_qty = 196465
betas = (0.9, 0.999)
eps = 1e-08

📄 ライセンス

このモデルはOpenRailライセンスの下で公開されています。

BibTeXエントリと引用情報

@inproceedings{chatgpt_paraphraser,
  author={Vladimir Vorobev, Maxim Kuznetsov},
  title={A paraphrasing model based on ChatGPT paraphrases},
  year={2023}
}