bart-finetuned-samsum开源对话摘要模型 - 优化对话摘要效果，免费可用！

首页

Bart Finetuned Samsum

由 luisotorres 开发

基于BART-large-xsum微调的对话摘要模型，专门针对SamSum对话数据集优化

文本生成

Transformers

英语#对话摘要 #高ROUGE得分 #英文会话处理

下载量 177

发布时间 : 10/30/2023

模型简介

该模型专注于生成对话文本的摘要，能够从对话中提取关键信息并生成简洁的总结。

模型特点

对话摘要优化

专门针对对话数据进行微调，能够有效理解对话上下文并生成准确摘要

基于BART架构

利用BART强大的序列到序列学习能力，实现高质量的文本生成

SamSum数据集微调

使用专业对话摘要数据集SamSum进行训练，提升对话场景下的表现

模型能力

对话文本摘要

关键信息提取

自然语言生成

使用案例

对话分析

客服对话总结

自动生成客服对话的关键问题和解决方案摘要

提高客服效率，便于后续分析

会议记录精简

将冗长的会议对话转化为简洁的要点总结

节省阅读时间，快速掌握会议核心内容

🚀 bart-finetuned-samsum 模型

本模型是专门对 facebook/bart-large-xsum 进行适配的版本，通过使用 SamSum 数据集进行微调，以提升其在对话摘要任务上的性能。

🚀 快速开始

本模型可用于对话摘要任务。以下是使用示例：

from transformers import pipeline

model = pipeline("summarization", model="luisotorres/bart-finetuned-samsum")

conversation = '''Sarah: Do you think it's a good idea to invest in Bitcoin?
    Emily: I'm skeptical. The market is very volatile, and you could lose money.
    Sarah: True. But there's also a high upside, right?                                     
'''
model(conversation)

✨ 主要特性

基于 facebook/bart-large-xsum 模型进行微调。
针对 SamSum 数据集优化，在对话摘要任务上表现出色。

📦 安装指南

文档未提及安装步骤，可参考 transformers 库的官方安装指南进行安装。

💻 使用示例

基础用法

from transformers import pipeline

model = pipeline("summarization", model="luisotorres/bart-finetuned-samsum")

conversation = '''Sarah: Do you think it's a good idea to invest in Bitcoin?
    Emily: I'm skeptical. The market is very volatile, and you could lose money.
    Sarah: True. But there's also a high upside, right?                                     
'''
model(conversation)

📚 详细文档

开发相关

Kaggle Notebook：Text Summarization with Large Language Models

训练参数

evaluation_strategy = "epoch",
save_strategy = 'epoch',
load_best_model_at_end = True,
metric_for_best_model = 'eval_loss',
seed = 42,
learning_rate=2e-5,
per_device_train_batch_size=4,
per_device_eval_batch_size=4,
gradient_accumulation_steps=2,
weight_decay=0.01,
save_total_limit=2,
num_train_epochs=4,
predict_with_generate=True,
fp16=True,
report_to="none"

参考资料

本模型基于原始的 BART 架构，详情可参考： Lewis et al. (2019). BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. arXiv:1910.13461