SummLlama3-70Bオープンソーステキスト要約モデル - 無料でデプロイし、効率的で正確かつ簡潔な要約を実現

ホーム

Summllama3 70B

DISLabによって開発

SummLlama3-70BはLlama3-70B-Instructをベースに初期化されたテキスト要約モデルで、大規模な要約フィードバックを用いたDPOトレーニングにより最適化され、忠実性、完全性、簡潔性の面で優れた性能を発揮します。

大規模言語モデル

Safetensors

#多領域要約 #DPO最適化 #高忠実性

ダウンロード数 15

リリース時間 : 10/11/2024

モデル概要

多領域のテキスト要約に特化した先進的なモデルで、対話と非対話のテキストをサポートし、ニュース、医療など7つの分野をカバーし、人間の嗜好に合った要約を生成します。

モデル特徴

多領域適合

ニュース、生活スタイル、医療など7つの分野をカバーし、対話と非対話のテキストの要約生成をサポートします。

DPO最適化

10万件以上のLLMが生成した要約フィードバックに基づいて直接嗜好最適化を行い、要約品質を大幅に向上させます。

三次元的な優位性

忠実性（0.950）、完全性（0.632）、簡潔性（0.754）のすべての面でベースラインモデルを上回っています。

モデル能力

長文要約

対話記録要約

異分野の要約生成

重要情報抽出

使用事例

コンテンツ集約

ニュースブリーフ生成

複数のニュース記事から核心的なイベントを抽出して毎日のブリーフを生成します。

自動評価による忠実性が0.95に達し、人手による編集時間を80％削減します。

会議記録

会議議事録自動生成

複数者の対話記録を構造化されたアクションアイテムの要約にまとめます。

テストセットでLlama3-70Bよりも完全性が6％向上します。

🚀 SummLlama3-70B

あなたは、複数のドメインにわたって、より人間が好む要約を生成できる要約器を探していますか？

私たちのSummLlama3-70Bがあなたの求めるものかもしれません！

SummLlama3-70Bは、Llama3-70B-Instructを初期モデルとして、大規模（10万件以上）の要約フィードバックに基づいてDirect Preference Optimization (DPO) を用いて追加学習を行っています。

このフィードバックには、短いテキストから長いテキストまで、対話形式と非対話形式の両方を含む幅広い入力文書が含まれており、7つの異なるドメインにまたがっています。

4つの非対話ドメイン：ニュース、ライフスタイル、レポート、医療
3つの対話ドメイン：日常生活、インタビュー、会議

これは自動評価の結果です：

設定	忠実度	完全性	簡潔性	平均
Llama3-8B-Instruct	0.864	0.583	0.450	0.632
Llama3-70B-Instruct	0.931	0.596	0.487	0.671
GPT-4o	0.940	0.657	0.437	0.678
SummLlama3-70B	0.950	0.632	0.754	0.779

テキスト要約の文脈でLLMが生成したフィードバックをどのように活用するかについては、当社の論文を参照してください。

SummLlama3-70B https://huggingface.co/DISLab/SummLlama3-8B https://huggingface.co/DISLab/SummLlama3-70B

SummLlama3.1-Series https://huggingface.co/DISLab/SummLlama3.1-8B https://huggingface.co/DISLab/SummLlama3.1-70B

SummLlama3.2-Series https://huggingface.co/DISLab/SummLlama3.2-3B

💡 テキスト要約の推奨プロンプト

当モデルは以下のプロンプトを使用して学習されているため、要約を取得する際にはこのプロンプトを使用することを推奨します。

def format_chat_template(document):
    instruction = "Please summarize the input documnet."
    row_json = [{"role": "user", "content": f"Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\n{instruction}\n\n### Input:\n{document}\n\n### Response:\n"}]
    return tokenizer.apply_chat_template(row_json, tokenize=False)

✨ 主な機能

高価な人間のフィードバックに依存するのではなく、大規模言語モデル (LLM) が生成した高品質、多次元、細粒度のフィードバックを利用しています。

このモデルは、良い要約器を判断するための人間が好む3つの側面である忠実度、完全性、簡潔性に優れています。

忠実度：要約器は入力テキストの情報を操作せず、入力テキストから直接推測できない情報を追加しません。
完全性：要約器は入力テキストのすべての重要な情報を出力要約に含めます。
簡潔性：要約器は出力に重要情報以外の情報を含めず、簡潔で焦点のある要約を維持します。

要約品質の人間評価と自動評価を含む包括的な評価に基づき、SummLlama3は元のLlama3シリーズに比べて大幅な改善を示しました。

以下はその結果です：

人間評価

設定	忠実度	完全性	簡潔性	平均
Llama3-8B-Instruct	0.902	0.636	0.784	0.774
Llama3-70B-Instruct	0.953	0.659	0.792	0.801
SummLlama3-8B	0.980	0.697	0.959	0.879

FineSurEを使用した自動評価

設定	忠実度	完全性	簡潔性	平均
Llama3-8B-Instruct	0.864	0.583	0.450	0.632
Llama3-70B-Instruct	0.931	0.596	0.487	0.671
SummLlama3-8B	0.931	0.614	0.659	0.735
SummLlama3-70B	0.950	0.632	0.754	0.779

💻 使用例

基本的な使用法

以下の文書に対するLlama3-8/70B-InstructとSummLlama3-8Bの要約の改善例を見てみましょう。

話者	対話内容
人物1	Hey, Paul, you're still having Thanksgiving dinner at my house on Thursday, right?
人物2	Yeah, thanks again for the invitation. I was worried I'd have to spend it alone after my parents announced they'd be in Finland.
人物1	I think it's strange that you're not going with them. Is it because of the cold weather in Northern Europe?
人物2	Not exactly. I have a lot of work to do next week.
人物1	Well, I'm glad you won't be alone on a holiday. It'll be great to have you celebrate with us. The more the merrier!
人物2	Exactly, how many people will be there?
人物1	Let's see, my parents, my grandpa, my sister, my brother-in-law, my niece and my two cousins, plus us. So, ten people altogether.
人物2	Should I bring anything?
人物1	Nope, we've got it covered.
人物2	Come on, I hate to show up empty-handed. You are being so generous to have me. I know! I'll bring pie. I'll make a great pumpkin pie.
人物1	Sounds delicious, but my sister has already volunteered for dessert. Look, don't worry about it. There will be tons of food. Turkey, stuffing, green beans, sweet potatoes.
人物2	Wow, I'm getting hungry just thinking about it. How about this, I'll bring a nice bottle of wine. You can never have too much of that.
人物1	Well, isn't that the truth? Sounds good to me.

モデル	要約
Llama3-8B-Instruct	Person1 invites Person2 to join them for Thanksgiving dinner at their house. Person2 accepts the invitation, citing that they were worried they would have to spend the holiday alone after their parents announced they would be in Finland. Person1 reassures Person2 that they are welcome and that the more the merrier. Person2 asks how many people will be attending and is told that there will be 10 people, including their family and cousins. Person2 offers to bring a gift, but Person1 declines, saying that they have everything covered. Person2 persists and offers to bring a bottle of wine or a pie, but Person1 suggests that their sister has already volunteered to bring dessert.
Llama3-70B-Instruct	Person 1 invites Person 2 to their house for Thanksgiving dinner on Thursday. Person 2 is grateful for the invitation and explains they can't go to Finland with their parents due to work commitments. Person 1 is happy to have them celebrate with their family, which will consist of 10 people. When Person 2 offers to bring something, Person 1 declines, but eventually agrees to let them bring a bottle of wine.
SummLlama3-8B	The conversation is about Thanksgiving dinner arrangements, with Person2 (Paul) confirming his attendance at Person1's (host's) house on Thursday, discussing what to bring and what's already been taken care of, and finalizing the offer to bring wine instead of pie since someone else is handling desserts.

SummLlama3-8Bの要約は、以下の理由から人間が好む要約と見なすことができます。

核心的な焦点：要約は会話の主要なテーマである感謝祭ディナーの手配を正確に捉えています。2人が計画を確認し、持ち物を話し合い、人物2がパイの代わりにワインを持ってくることを決定する流れを強調しています。これにより、核心的なコンテキストが維持されます。
重要事実の包含：要約は会話の重要な詳細を網羅しており、人物2が最初にデザート（カボチャパイ）を持ってくる提案と、他の家族がデザートを担当するためにワインに変更する流れを含んでいます。他の要約はこの流れを見落としたり単純化したりする傾向がありますが、SummLlama3-8Bは会話の重要な出来事を完全に捉えています。
明瞭性と簡潔性：要約は直截的で簡潔な方法で構成されており、不要な詳細を含まずに会話を効果的に要約しています。議論の流れと結果を明確に提示しており、読者が理解しやすくなっています。イベントの論理的な順序が維持されており、スムーズな物語性が保たれています。
正確な役割の描写：要約は人物1をホスト、ポール（人物2）をゲストとして明確に識別しており、彼らの関係と会話の性質を明確にしています。この区別は他の要約と比較してSummLlama3-8Bでより明確になっています。

📚 詳細文档

モデル情報

属性	详情
モデルタイプ	要約モデル
学習データ	大規模（10万件以上）の要約フィードバック。入力文書は短いテキストから長いテキストまで、対話形式と非対話形式の両方を含み、7つの異なるドメインにまたがっています。