bart-finetuned-samsum開源對話摘要模型 - 優化對話摘要效果，免費可用！

首頁

Bart Finetuned Samsum

由luisotorres開發

基於BART-large-xsum微調的對話摘要模型，專門針對SamSum對話數據集優化

文本生成

Transformers

英語#對話摘要 #高ROUGE得分 #英文會話處理

下載量 177

發布時間 : 10/30/2023

模型概述

該模型專注於生成對話文本的摘要，能夠從對話中提取關鍵信息並生成簡潔的總結。

模型特點

對話摘要優化

專門針對對話數據進行微調，能夠有效理解對話上下文並生成準確摘要

基於BART架構

利用BART強大的序列到序列學習能力，實現高質量的文本生成

SamSum數據集微調

使用專業對話摘要數據集SamSum進行訓練，提升對話場景下的表現

模型能力

對話文本摘要

關鍵信息提取

自然語言生成

使用案例

對話分析

客服對話總結

自動生成客服對話的關鍵問題和解決方案摘要

提高客服效率，便於後續分析

會議記錄精簡

將冗長的會議對話轉化為簡潔的要點總結

節省閱讀時間，快速掌握會議核心內容

🚀 bart-finetuned-samsum 模型

本模型是專門對 facebook/bart-large-xsum 進行適配的版本，通過使用 SamSum 數據集進行微調，以提升其在對話摘要任務上的性能。

🚀 快速開始

本模型可用於對話摘要任務。以下是使用示例：

from transformers import pipeline

model = pipeline("summarization", model="luisotorres/bart-finetuned-samsum")

conversation = '''Sarah: Do you think it's a good idea to invest in Bitcoin?
    Emily: I'm skeptical. The market is very volatile, and you could lose money.
    Sarah: True. But there's also a high upside, right?                                     
'''
model(conversation)

✨ 主要特性

基於 facebook/bart-large-xsum 模型進行微調。
針對 SamSum 數據集優化，在對話摘要任務上表現出色。

📦 安裝指南

文檔未提及安裝步驟，可參考 transformers 庫的官方安裝指南進行安裝。

💻 使用示例

基礎用法

from transformers import pipeline

model = pipeline("summarization", model="luisotorres/bart-finetuned-samsum")

conversation = '''Sarah: Do you think it's a good idea to invest in Bitcoin?
    Emily: I'm skeptical. The market is very volatile, and you could lose money.
    Sarah: True. But there's also a high upside, right?                                     
'''
model(conversation)

📚 詳細文檔

開發相關

Kaggle Notebook：Text Summarization with Large Language Models

訓練參數

evaluation_strategy = "epoch",
save_strategy = 'epoch',
load_best_model_at_end = True,
metric_for_best_model = 'eval_loss',
seed = 42,
learning_rate=2e-5,
per_device_train_batch_size=4,
per_device_eval_batch_size=4,
gradient_accumulation_steps=2,
weight_decay=0.01,
save_total_limit=2,
num_train_epochs=4,
predict_with_generate=True,
fp16=True,
report_to="none"

參考資料

本模型基於原始的 BART 架構，詳情可參考： Lewis et al. (2019). BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. arXiv:1910.13461