vit5-base-vietnews-summarization开源模型 - 免费支持越南语文本自动摘要生成

首页

Vit5 Base Vietnews Summarization

由 VietAI 开发

基于ViT5-Base模型在越南新闻数据集上微调的文本摘要生成模型，支持越南语文本的自动摘要生成。

文本生成其他开源协议:MIT #越南语摘要生成 #文本到文本转换 #预训练模型

下载量 1,145

发布时间 : 9/7/2022

模型简介

该模型是一个预训练的Transformer编码器-解码器模型，专门针对越南语文本摘要任务进行了优化，能够生成高质量、简洁的摘要。

模型特点

越南语优化

专门针对越南语文本处理和摘要生成进行了优化

高性能

在VietNews数据集上达到了最先进的性能水平

易于使用

提供简单的Hugging Face接口，便于集成到现有系统中

模型能力

越南语文本理解

自动摘要生成

长文本压缩

使用案例

新闻媒体

新闻文章摘要

自动生成新闻文章的简短摘要

帮助读者快速了解新闻要点

内容分析

文档内容提取

从长文档中提取关键信息

提高信息处理效率

🚀 ViT5-Base在`vietnews`摘要式文本摘要任务上微调（无需前缀）

这是一个基于Transformer架构的预训练编码器 - 解码器模型，在越南语处理方面达到了当前最优水平。它能有效解决越南语文本摘要的问题，为越南语相关的自然语言处理任务提供强大支持。

🚀 快速开始

如需更多详细信息，请查看我们的Github仓库和评估脚本。

💻 使用示例

基础用法

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("VietAI/vit5-base-vietnews-summarization")  
model = AutoModelForSeq2SeqLM.from_pretrained("VietAI/vit5-base-vietnews-summarization")
model.cuda()

sentence = "VietAI là tổ chức phi lợi nhuận với sứ mệnh ươm mầm tài năng về trí tuệ nhân tạo và xây dựng một cộng đồng các chuyên gia trong lĩnh vực trí tuệ nhân tạo đẳng cấp quốc tế tại Việt Nam."
sentence = sentence + "</s>"
encoding = tokenizer(sentence, return_tensors="pt")
input_ids, attention_masks = encoding["input_ids"].to("cuda"), encoding["attention_mask"].to("cuda")
outputs = model.generate(
    input_ids=input_ids, attention_mask=attention_masks,
    max_length=256,
    early_stopping=True
)
for output in outputs:
    line = tokenizer.decode(output, skip_special_tokens=True, clean_up_tokenization_spaces=True)
    print(line)

📚 详细文档

属性	详情
数据集	cc100
标签	文本摘要

📄 许可证

本项目采用MIT许可证。

📖 引用

@inproceedings{phan-etal-2022-vit5,
    title = "{V}i{T}5: Pretrained Text-to-Text Transformer for {V}ietnamese Language Generation",
    author = "Phan, Long and Tran, Hieu and Nguyen, Hieu and Trinh, Trieu H.",
    booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop",
    year = "2022",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.naacl-srw.18",
    pages = "136--142",
}