vit5-large-vietnews-summarization開源模型 - 免費部署實現越南語新聞精準摘要

首頁

Vit5 Large Vietnews Summarization

由VietAI開發

越南語最先進的預訓練Transformer編碼器-解碼器模型，專門針對越南語新聞摘要任務進行了微調。

文本生成

Transformers

其他開源協議:MIT #越南語摘要生成 #文本到文本轉換 #預訓練Transformer

下載量 1,210

發布時間 : 5/12/2022

模型概述

該模型是基於ViT5-large架構的文本到文本轉換模型，專門用於越南語新聞摘要生成任務。

模型特點

越南語優化

專門針對越南語文本處理和生成進行了優化

摘要生成能力

能夠從越南語新聞文本中生成高質量的摘要

預訓練模型

基於大規模預訓練的Transformer架構，具有強大的語言理解能力

模型能力

越南語文本理解

新聞摘要生成

文本壓縮

使用案例

新聞媒體

新聞摘要自動生成

自動從越南語新聞文章中生成簡明扼要的摘要

在VietNews數據集上表現優異

內容管理

文檔摘要

為長文檔生成關鍵信息摘要

🚀 ViT5-large在`vietnews`摘要生成任務上微調模型

這是一個基於Transformer架構的預訓練編碼器 - 解碼器模型，在越南語摘要生成任務上達到了當前最優水平。它能高效地處理越南語文本，為越南語的摘要生成任務提供強大支持。

🚀 快速開始

若需更多詳細信息，請查看我們的GitHub倉庫和評估腳本。

💻 使用示例

基礎用法

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("VietAI/vit5-large-vietnews-summarization")  
model = AutoModelForSeq2SeqLM.from_pretrained("VietAI/vit5-large-vietnews-summarization")
model.cuda()

sentence = "VietAI là tổ chức phi lợi nhuận với sứ mệnh ươm mầm tài năng về trí tuệ nhân tạo và xây dựng một cộng đồng các chuyên gia trong lĩnh vực trí tuệ nhân tạo đẳng cấp quốc tế tại Việt Nam."
text =  "vietnews: " + sentence + " </s>"
encoding = tokenizer(text, return_tensors="pt")
input_ids, attention_masks = encoding["input_ids"].to("cuda"), encoding["attention_mask"].to("cuda")
outputs = model.generate(
    input_ids=input_ids, attention_mask=attention_masks,
    max_length=256,
    early_stopping=True
)
for output in outputs:
    line = tokenizer.decode(output, skip_special_tokens=True, clean_up_tokenization_spaces=True)
    print(line)

📄 許可證

本項目採用MIT許可證。

📚 引用信息

@inproceedings{phan-etal-2022-vit5,
    title = "{V}i{T}5: Pretrained Text-to-Text Transformer for {V}ietnamese Language Generation",
    author = "Phan, Long and Tran, Hieu and Nguyen, Hieu and Trinh, Trieu H.",
    booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop",
    year = "2022",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.naacl-srw.18",
    pages = "136--142",
}