vit5-large-vietnews-summarization开源模型 - 免费部署实现越南语新闻精准摘要

首页

Vit5 Large Vietnews Summarization

由 VietAI 开发

越南语最先进的预训练Transformer编码器-解码器模型，专门针对越南语新闻摘要任务进行了微调。

文本生成

Transformers

其他开源协议:MIT #越南语摘要生成 #文本到文本转换 #预训练Transformer

下载量 1,210

发布时间 : 5/12/2022

模型简介

该模型是基于ViT5-large架构的文本到文本转换模型，专门用于越南语新闻摘要生成任务。

模型特点

越南语优化

专门针对越南语文本处理和生成进行了优化

摘要生成能力

能够从越南语新闻文本中生成高质量的摘要

预训练模型

基于大规模预训练的Transformer架构，具有强大的语言理解能力

模型能力

越南语文本理解

新闻摘要生成

文本压缩

使用案例

新闻媒体

新闻摘要自动生成

自动从越南语新闻文章中生成简明扼要的摘要

在VietNews数据集上表现优异

内容管理

文档摘要

为长文档生成关键信息摘要

🚀 ViT5-large在`vietnews`摘要生成任务上微调模型

这是一个基于Transformer架构的预训练编码器 - 解码器模型，在越南语摘要生成任务上达到了当前最优水平。它能高效地处理越南语文本，为越南语的摘要生成任务提供强大支持。

🚀 快速开始

若需更多详细信息，请查看我们的GitHub仓库和评估脚本。

💻 使用示例

基础用法

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("VietAI/vit5-large-vietnews-summarization")  
model = AutoModelForSeq2SeqLM.from_pretrained("VietAI/vit5-large-vietnews-summarization")
model.cuda()

sentence = "VietAI là tổ chức phi lợi nhuận với sứ mệnh ươm mầm tài năng về trí tuệ nhân tạo và xây dựng một cộng đồng các chuyên gia trong lĩnh vực trí tuệ nhân tạo đẳng cấp quốc tế tại Việt Nam."
text =  "vietnews: " + sentence + " </s>"
encoding = tokenizer(text, return_tensors="pt")
input_ids, attention_masks = encoding["input_ids"].to("cuda"), encoding["attention_mask"].to("cuda")
outputs = model.generate(
    input_ids=input_ids, attention_mask=attention_masks,
    max_length=256,
    early_stopping=True
)
for output in outputs:
    line = tokenizer.decode(output, skip_special_tokens=True, clean_up_tokenization_spaces=True)
    print(line)

📄 许可证

本项目采用MIT许可证。

📚 引用信息

@inproceedings{phan-etal-2022-vit5,
    title = "{V}i{T}5: Pretrained Text-to-Text Transformer for {V}ietnamese Language Generation",
    author = "Phan, Long and Tran, Hieu and Nguyen, Hieu and Trinh, Trieu H.",
    booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop",
    year = "2022",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.naacl-srw.18",
    pages = "136--142",
}