ptt5-v2-base開源模型 - 基於Google繼續訓練，助力葡萄牙語相關應用

首頁

Ptt5 V2 Base

由unicamp-dl開發

ptt5-v2是針對葡萄牙語預訓練的T5模型系列，基於Google原始檢查點繼續訓練而成。

大型語言模型

Transformers

其他開源協議:Apache-2.0 #葡萄牙語文本生成 #T5架構優化 #多任務學習

下載量 1,197

發布時間 : 6/9/2024

模型概述

該模型是基於T5架構的葡萄牙語預訓練模型，主要用於文本生成任務。

模型特點

葡萄牙語優化

專門針對葡萄牙語進行了持續預訓練，優化了語言理解能力

基於T5架構

採用Google T5-base架構，具有良好的文本生成能力

多用途文本處理

可用於多種文本生成和轉換任務

模型能力

葡萄牙語文本生成

文本轉換

語言理解

使用案例

自然語言處理

葡萄牙語文本摘要

生成葡萄牙語文本的簡潔摘要

問答系統

構建葡萄牙語問答系統

🚀 ptt5-v2-base

ptt5-v2模型是專門為葡萄牙語定製的預訓練T5模型，它基於谷歌原始的檢查點繼續訓練，模型大小從t5-small到t5-3B不等。這些檢查點被用於訓練葡萄牙語的MonoT5重排器，你可以在它們的HuggingFace集合中找到。如需瞭解更多關於預訓練過程的信息，請參考我們的論文ptt5-v2: A Closer Look at Continued Pretraining of T5 Models for the Portuguese Language。

🚀 快速開始

模型信息

屬性	詳情
數據集	allenai/c4、legacy-datasets/mc4
語言	葡萄牙語（pt）
任務類型	文本到文本生成
基礎模型	google-t5/t5-base
許可證	apache-2.0

模型介紹

ptt5-v2模型是專門為葡萄牙語定製的預訓練T5模型，它在谷歌原始檢查點的基礎上繼續訓練，模型大小涵蓋從t5-small到t5-3B。這些檢查點用於訓練葡萄牙語的MonoT5重排器，可在其HuggingFace集合中找到。關於預訓練過程的更多信息，請參考我們的論文ptt5-v2: A Closer Look at Continued Pretraining of T5 Models for the Portuguese Language。

💻 使用示例

基礎用法

from transformers import T5Tokenizer, T5ForConditionalGeneration

tokenizer = T5Tokenizer.from_pretrained("unicamp-dl/ptt5-v2-base")
model = T5ForConditionalGeneration.from_pretrained("unicamp-dl/ptt5-v2-base")

📄 許可證

本項目採用apache-2.0許可證。

📚 詳細文檔

引用信息

如果你使用了我們的模型，請按照以下格式進行引用：

@article{piau2024ptt5v2,
      title={ptt5-v2: A Closer Look at Continued Pretraining of T5 Models for the Portuguese Language}, 
      author={Marcos Piau and Roberto Lotufo and Rodrigo Nogueira},
      year={2024},
      eprint={2406.10806},
      archivePrefix={arXiv},
      primaryClass={id='cs.CL' full_name='Computation and Language' is_active=True alt_name='cmp-lg' in_archive='cs' is_general=False description='Covers natural language processing. Roughly includes material in ACM Subject Class I.2.7. Note that work on artificial languages (programming languages, logics, formal systems) that does not explicitly address natural-language issues broadly construed (natural-language processing, computational linguistics, speech, text retrieval, etc.) is not appropriate for this area.'}
}