SummLlama3-8Bオープンソーステキスト要約モデル - フィードバックに基づくトレーニングで、要約が忠実で完全かつ簡潔

ホーム

Summllama3 8B

DISLabによって開発

SummLlama3-8BはLlama3-8B-Instructを初期化したテキスト要約モデルで、大規模な要約フィードバックによるDPOトレーニングにより最適化され、忠実性、完全性、簡潔性に優れた性能を発揮します。

テキスト生成

Safetensors

#DPO最適化要約 #クロスドメイン要約 #人間の嗜好との整合

ダウンロード数 15

リリース時間 : 10/11/2024

モデル概要

人間の嗜好に合致したクロスドメインテキスト要約の生成に特化し、ニュース、医療、会議など7つのシナリオをサポート。Llama3-70BやGPT-4oを凌駕する性能。

モデル特徴

クロスドメイン最適化

ニュース/医療/会議など7分野をカバーし、対話型・非対話型テキストに対応

3指標のバランス

忠実性(0.98)、完全性(0.697)、簡潔性(0.959)で全面的にリード

効率的な推論

8Bパラメータ規模で70Bモデルを上回る性能を実現し、推論速度も高速

LLMフィードバックトレーニング

10万件以上の要約に対するLLM生成フィードバックを用いたDPOトレーニングにより、人手アノテーションコストを回避

モデル能力

マルチドメインテキスト要約

対話内容の凝縮

キー情報抽出

長文の構造化圧縮

使用事例

メディア業界

ニュースブリーフ生成

ニュースの核心的事実を自動抽出

イベントの流れを保持しつつテキスト量を70%削減

医療健康

診療記録要約

診療のキー情報を抽出

ベースライン比12%精度向上

企業オフィス

会議議事録生成

議論の要点と決議事項を自動要約

アクションアイテムを完全に捕捉し冗長情報なし

🚀 SummLlama3-8B

複数のドメインにわたって、より人間に好まれる要約を生成できる要約器をお探しですか？私たちのSummLlama3-8Bがあなたの求めるものかもしれません！

🚀 クイックスタート

SummLlama3-8Bは、大規模（10万件以上）の要約フィードバックに基づいてDirect Preference Optimization (DPO) を用いて追加学習を行った、Llama3-8B-Instructをベースにした要約モデルです。

モデルの基本情報

Property	Details
Base Model	meta-llama/Meta-Llama-3-8B-Instruct
Pipeline Tag	summarization

評価結果

自動評価結果

Config.	Faithfulness	Completeness	Conciseness	Average
Llama3-8B-Instruct	0.864	0.583	0.450	0.632
Llama3-70B-Instruct	0.931	0.596	0.487	0.671
GPT-4o	0.940	0.657	0.437	0.678
SummLlama3-8B	0.931	0.614	0.659	0.735
SummLlama3-70B	0.950	0.632	0.754	0.779

人間評価結果

Config.	Faithfulness	Completeness	Conciseness	Average
Llama3-8B-Instruct	0.902	0.636	0.784	0.774
Llama3-70B-Instruct	0.953	0.659	0.792	0.801
SummLlama3-8B	0.980	0.697	0.959	0.879

他のバージョン

SummLlama3-70B
- https://huggingface.co/DISLab/SummLlama3-70B
SummLlama3.1-Series
- https://huggingface.co/DISLab/SummLlama3.1-8B
- https://huggingface.co/DISLab/SummLlama3.1-70B
SummLlama3.2-Series
- https://huggingface.co/DISLab/SummLlama3.2-3B

推奨プロンプト

以下のプロンプトを使用することをおすすめします。このプロンプトを使用してモデルを学習させています。

def format_chat_template(document):
    instruction = "Please summarize the input documnet."
    row_json = [{"role": "user", "content": f"Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\n{instruction}\n\n### Input:\n{document}\n\n### Response:\n"}]
    return tokenizer.apply_chat_template(row_json, tokenize=False)

要約器の概要

高価な人間のフィードバックに依存するのではなく、大規模言語モデル (LLM) が生成する高品質、多次元、細粒度のフィードバックを利用しています。このモデルは、人間が良い要約器を判断する3つの側面である Faithfulness、Completeness、Conciseness で優れています。

Faithfulness: 要約器は入力テキストの情報を操作せず、入力テキストから直接推測できない情報を追加しません。
Completeness: 要約器は入力テキストのすべての重要な情報を出力要約に含めます。
Conciseness: 要約器は出力に重要な情報以外の情報を含めず、簡潔で焦点のある要約を維持します。

具体的な例

Speaker	Dialogue
Person 1	Hey, Paul, you're still having Thanksgiving dinner at my house on Thursday, right?
Person 2	Yeah, thanks again for the invitation. I was worried I'd have to spend it alone after my parents announced they'd be in Finland.
Person 1	I think it's strange that you're not going with them. Is it because of the cold weather in Northern Europe?
Person 2	Not exactly. I have a lot of work to do next week.
Person 1	Well, I'm glad you won't be alone on a holiday. It'll be great to have you celebrate with us. The more the merrier!
Person 2	Exactly, how many people will be there?
Person 1	Let's see, my parents, my grandpa, my sister, my brother-in-law, my niece and my two cousins, plus us. So, ten people altogether.
Person 2	Should I bring anything?
Person 1	Nope, we've got it covered.
Person 2	Come on, I hate to show up empty-handed. You are being so generous to have me. I know! I'll bring pie. I'll make a great pumpkin pie.
Person 1	Sounds delicious, but my sister has already volunteered for dessert. Look, don't worry about it. There will be tons of food. Turkey, stuffing, green beans, sweet potatoes.
Person 2	Wow, I'm getting hungry just thinking about it. How about this, I'll bring a nice bottle of wine. You can never have too much of that.
Person 1	Well, isn't that the truth? Sounds good to me.

Model	Summary
Llama3-8B-Instruct	Person1 invites Person2 to join them for Thanksgiving dinner at their house. Person2 accepts the invitation, citing that they were worried they would have to spend the holiday alone after their parents announced they would be in Finland. Person1 reassures Person2 that they are welcome and that the more the merrier. Person2 asks how many people will be attending and is told that there will be 10 people, including their family and cousins. Person2 offers to bring a gift, but Person1 declines, saying that they have everything covered. Person2 persists and offers to bring a bottle of wine or a pie, but Person1 suggests that their sister has already volunteered to bring dessert.
Llama3-70B-Instruct	Person 1 invites Person 2 to their house for Thanksgiving dinner on Thursday. Person 2 is grateful for the invitation and explains they can't go to Finland with their parents due to work commitments. Person 1 is happy to have them celebrate with their family, which will consist of 10 people. When Person 2 offers to bring something, Person 1 declines, but eventually agrees to let them bring a bottle of wine.
SummLlama3-8B	The conversation is about Thanksgiving dinner arrangements, with Person2 (Paul) confirming his attendance at Person1's (host's) house on Thursday, discussing what to bring and what's already been taken care of, and finalizing the offer to bring wine instead of pie since someone else is handling desserts.

SummLlama3-8Bの要約が人間に好まれる理由

コアフォーカス: 要約は会話の主要なテーマである感謝祭ディナーの手配を正確に捉えています。2人が計画を確認し、持ち物を話し合い、Person2がパイの代わりにワインを持ってくることを決定する流れを強調しており、コアコンテキストを維持しています。
重要事実の包含: 要約は会話の重要な詳細を網羅しており、Person2が最初にデザート（カボチャパイ）を持ってくると提案し、別の家族がデザートを担当するためにワインを持ってくることに変更したことを含んでいます。他の要約はこの流れを見落としたり単純化したりする傾向がありますが、SummLlama3-8Bは会話の重要なイベントを完全に捉えています。
明瞭性と簡潔性: 要約はシンプルで簡潔な方法で構成されており、不要な詳細なしに会話を効果的に要約しています。会話の流れと結果を明確に提示しており、読者が理解しやすくなっています。イベントの論理的な順序が維持されており、スムーズなナラティブが保たれています。