vit5-base-vietnews-summarization開源模型 - 免費支持越南語文本自動摘要生成

首頁

Vit5 Base Vietnews Summarization

由VietAI開發

基於ViT5-Base模型在越南新聞數據集上微調的文本摘要生成模型，支持越南語文本的自動摘要生成。

文本生成其他開源協議:MIT #越南語摘要生成 #文本到文本轉換 #預訓練模型

下載量 1,145

發布時間 : 9/7/2022

模型概述

該模型是一個預訓練的Transformer編碼器-解碼器模型，專門針對越南語文本摘要任務進行了優化，能夠生成高質量、簡潔的摘要。

模型特點

越南語優化

專門針對越南語文本處理和摘要生成進行了優化

高性能

在VietNews數據集上達到了最先進的性能水平

易於使用

提供簡單的Hugging Face接口，便於集成到現有系統中

模型能力

越南語文本理解

自動摘要生成

長文本壓縮

使用案例

新聞媒體

新聞文章摘要

自動生成新聞文章的簡短摘要

幫助讀者快速瞭解新聞要點

內容分析

文檔內容提取

從長文檔中提取關鍵信息

提高信息處理效率

🚀 ViT5-Base在`vietnews`摘要式文本摘要任務上微調（無需前綴）

這是一個基於Transformer架構的預訓練編碼器 - 解碼器模型，在越南語處理方面達到了當前最優水平。它能有效解決越南語文本摘要的問題，為越南語相關的自然語言處理任務提供強大支持。

🚀 快速開始

如需更多詳細信息，請查看我們的Github倉庫和評估腳本。

💻 使用示例

基礎用法

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("VietAI/vit5-base-vietnews-summarization")  
model = AutoModelForSeq2SeqLM.from_pretrained("VietAI/vit5-base-vietnews-summarization")
model.cuda()

sentence = "VietAI là tổ chức phi lợi nhuận với sứ mệnh ươm mầm tài năng về trí tuệ nhân tạo và xây dựng một cộng đồng các chuyên gia trong lĩnh vực trí tuệ nhân tạo đẳng cấp quốc tế tại Việt Nam."
sentence = sentence + "</s>"
encoding = tokenizer(sentence, return_tensors="pt")
input_ids, attention_masks = encoding["input_ids"].to("cuda"), encoding["attention_mask"].to("cuda")
outputs = model.generate(
    input_ids=input_ids, attention_mask=attention_masks,
    max_length=256,
    early_stopping=True
)
for output in outputs:
    line = tokenizer.decode(output, skip_special_tokens=True, clean_up_tokenization_spaces=True)
    print(line)

📚 詳細文檔

屬性	詳情
數據集	cc100
標籤	文本摘要

📄 許可證

本項目採用MIT許可證。

📖 引用

@inproceedings{phan-etal-2022-vit5,
    title = "{V}i{T}5: Pretrained Text-to-Text Transformer for {V}ietnamese Language Generation",
    author = "Phan, Long and Tran, Hieu and Nguyen, Hieu and Trinh, Trieu H.",
    booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop",
    year = "2022",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.naacl-srw.18",
    pages = "136--142",
}