flan - t5 - base - samsum開源模型 - 免費部署助力對話摘要快速生成

首頁

Flan T5 Base Samsum

由sharmax-vikas開發

該模型是基於google/flan-t5-base在samsum對話摘要數據集上微調的版本，專門用於生成對話摘要任務。

文本生成

Transformers

開源協議:Apache-2.0 #對話摘要 #微調模型 #Rouge高分

下載量 26

發布時間 : 7/23/2024

模型概述

flan-t5-base-samsum是基於FLAN-T5基礎模型微調的對話摘要模型，在samsum測試集上取得了47.355的Rouge1分數。

模型特點

高效微調

在flan-t5-base基礎上針對對話摘要任務進行優化

多輪對話處理

專門針對對話場景設計，能有效捕捉對話關鍵信息

輕量級部署

基於T5架構，相比更大模型更易於部署

模型能力

對話摘要生成

文本壓縮

關鍵信息提取

使用案例

對話處理

會議紀要生成

將會議對話自動生成簡潔的會議紀要

Rouge1分數47.355

客服對話摘要

自動總結客服對話中的關鍵問題和解決方案

🚀 flan-t5-base-samsum

這個模型是 google/flan-t5-base 在 samsum 數據集上的微調版本。它在評估集上取得了以下結果：

損失：1.3736
Rouge1：47.355
Rouge2：23.7601
Rougel：39.8403
Rougelsum：43.4718
生成長度：17.1575

🚀 快速開始

模型信息

屬性	詳情
庫名稱	transformers
許可證	apache-2.0
基礎模型	google/flan-t5-base
標籤	generated_from_trainer
數據集	samsum
評估指標	rouge
模型索引名稱	flan-t5-base-samsum
任務類型	序列到序列語言建模（文本生成）
管道標籤	文本摘要

如何使用

from transformers import pipeline

pipe = pipeline("summarization", model="sharmax-vikas/flan-t5-base-samsum")

res = pipe('''dialogue: 
Margaret: Hi, in December I'd like to meet on 4th and 11th around 10:00 or 11:00. 
Evans: Hi, 4th - we can meet at 10:00.
Evans: And 11th - at 11:00. 
Margaret: Okey. And what about 18th?
Evans: I'm not sure about 18th. 
Evans: I might not be in town. 
Margaret: Okey, so we'll see. 
Evans: Yes. And I'll let you know next week. 
Margaret: If it's not 18th, maybe we could meet on 17th?
Evans: If I go away, I won't also be 17th.
Margaret: Okey, I get it. 
Evans: But we could meet 14th, if you like?
Margaret: Hm, I'm not sure whether I'm avaliable. 
Evans: So let's set these dates later, ok?
Margaret: Okey and we see each other 4th 10:00. 
Evans: Yes!''')

print(f"flan-t5-base summary:\n{res[0]['summary_text']}")

# 輸出: flan-t5-base summary:
# Margaret and Evans will meet on the 4th and 11th of December. They will meet at 10:00 on the 18th and at 11:00 on the 17th. If it's not 18th, they can meet on 17th or 14th.

🔧 技術細節

訓練超參數

訓練過程中使用了以下超參數：

學習率：5e-05
訓練批次大小：16
評估批次大小：16
隨機種子：42
優化器：Adam（β1=0.9，β2=0.999，ε=1e-08）
學習率調度器類型：線性
訓練輪數：5

訓練結果

訓練損失	輪數	步數	驗證損失	Rouge1	Rouge2	Rougel	Rougelsum	生成長度
1.3641	1.0	921	1.3780	47.4054	23.6308	39.8273	43.3697	17.3004
1.3074	2.0	1842	1.3736	47.355	23.7601	39.8403	43.4718	17.1575
1.2592	3.0	2763	1.3740	47.2208	23.4972	39.7293	43.2546	17.2320
1.2232	4.0	3684	1.3794	47.9156	24.2451	40.2628	43.9122	17.4017
1.2042	5.0	4605	1.3780	47.8982	24.1707	40.2955	43.8939	17.3712