mbart-summarization-fanpage开源模型 - 免费实现意大利语摘要智能生成

首页

Mbart Summarization Fanpage

由 ARTeLab 开发

基于mBART-large在Fanpage数据集上微调的意大利语抽象摘要生成模型

文本生成

Transformers

其他#意大利语摘要 #新闻摘要生成 #多语言mbart

下载量 30

发布时间 : 3/2/2022

模型简介

该模型专门用于意大利语文本的抽象摘要生成任务，基于Facebook的mBART-large架构在Fanpage新闻数据集上微调而成。

模型特点

意大利语优化

专门针对意大利语文本摘要任务进行微调

多语言预训练基础

基于支持25种语言的mBART-large模型

学术验证

相关研究成果已在Information期刊发表

模型能力

意大利语文本理解

抽象摘要生成

长文本压缩

使用案例

新闻媒体

新闻文章摘要

为意大利新闻网站的长篇文章生成简洁摘要

Rouge1分数达36.5

内容分析

文档关键信息提取

从意大利语文档中提取核心内容

平均生成长度75词

🚀 mbart-summarization-fanpage

这个模型是 facebook/mbart-large-cc25 在Fanpage数据集上针对抽象文本摘要任务进行微调后的版本。它可以根据给定的文本生成简洁且包含关键信息的摘要，为处理意大利语的抽象文本摘要任务提供了有效的解决方案。

🚀 快速开始

代码示例

from transformers import MBartTokenizer, MBartForConditionalGeneration
tokenizer = MBartTokenizer.from_pretrained("ARTeLab/mbart-summarization-fanpage")
model = MBartForConditionalGeneration.from_pretrained("ARTeLab/mbart-summarization-fanpage")

✨ 主要特性

该模型在Fanpage数据集上微调后，在抽象文本摘要任务中取得了以下成果：

损失值（Loss）：2.1833
Rouge1：36.5027
Rouge2：17.4428
Rougel：26.1734
Rougelsum：30.2636
生成文本长度（Gen Len）：75.2413

💻 使用示例

基础用法

from transformers import MBartTokenizer, MBartForConditionalGeneration
tokenizer = MBartTokenizer.from_pretrained("ARTeLab/mbart-summarization-fanpage")
model = MBartForConditionalGeneration.from_pretrained("ARTeLab/mbart-summarization-fanpage")

📚 详细文档

训练超参数

训练过程中使用了以下超参数：

属性	详情
学习率（learning_rate）	5e-05
训练批次大小（train_batch_size）	1
评估批次大小（eval_batch_size）	1
随机种子（seed）	42
优化器（optimizer）	Adam（β1=0.9，β2=0.999，ε=1e-08）
学习率调度器类型（lr_scheduler_type）	线性
训练轮数（num_epochs）	4.0

框架版本

Transformers 4.15.0.dev0
Pytorch 1.10.0+cu102
Datasets 1.15.1
Tokenizers 0.10.3

📄 引用

更多详细信息和结果请参考已发表的论文

@Article{info13050228,
    AUTHOR = {Landro, Nicola and Gallo, Ignazio and La Grassa, Riccardo and Federici, Edoardo},
    TITLE = {Two New Datasets for Italian-Language Abstractive Text Summarization},
    JOURNAL = {Information},
    VOLUME = {13},
    YEAR = {2022},
    NUMBER = {5},
    ARTICLE-NUMBER = {228},
    URL = {https://www.mdpi.com/2078-2489/13/5/228},
    ISSN = {2078-2489},
    ABSTRACT = {Text summarization aims to produce a short summary containing relevant parts from a given text. Due to the lack of data for abstractive summarization on low-resource languages such as Italian, we propose two new original datasets collected from two Italian news websites with multi-sentence summaries and corresponding articles, and from a dataset obtained by machine translation of a Spanish summarization dataset. These two datasets are currently the only two available in Italian for this task. To evaluate the quality of these two datasets, we used them to train a T5-base model and an mBART model, obtaining good results with both. To better evaluate the results obtained, we also compared the same models trained on automatically translated datasets, and the resulting summaries in the same training language, with the automatically translated summaries, which demonstrated the superiority of the models obtained from the proposed datasets.},
    DOI = {10.3390/info13050228}
}