ke-t5-large-ko開源跨語言模型 - 支持韓英知識驅動響應生成

首頁

Ke T5 Large Ko

由KETI-AIR開發

基於韓語和英語預訓練的T5模型，支持跨語言知識驅動的響應生成

大型語言模型韓語開源協議:Apache-2.0 #韓英雙語預訓練 #開放域對話生成 #跨語言知識遷移

下載量 17

發布時間 : 3/2/2022

模型概述

該模型是一個基於T5架構的多語言預訓練模型，主要用於開放域對話系統的響應生成任務，特別針對韓語和英語優化。

模型特點

跨語言知識應用

即使僅提供英語知識，也能提升韓語對話系統的性能

雙語預訓練

同時在韓語和英語語料上進行預訓練

知識驅動對話

能夠利用外部知識生成更準確的對話響應

模型能力

文本生成

跨語言知識轉移

開放域對話

使用案例

對話系統

韓語開放域聊天機器人

構建能夠進行自然韓語對話的聊天機器人

即使僅使用英語知識也能提升韓語對話質量

🚀 ke - t5 base

ke - t5 base是一個在韓語和英語上進行預訓練的T5模型。該模型為自然語言處理任務提供了強大的支持，尤其適用於跨語言的相關場景。若想了解更多詳細信息，請查看Github和論文韓語論文。

🚀 快速開始

模型和分詞器的加載

from transformers import AutoModel, AutoTokenizer

model = AutoModel.from_pretrained("KETI-AIR/ke-t5-large-ko")
tokenizer = AutoTokenizer.from_pretrained("KETI-AIR/ke-t5-large-ko")

📄 許可證

本項目採用Apache - 2.0許可證。

📚 詳細文檔

BibTeX引用和引用信息

@inproceedings{kim-etal-2021-model-cross,
    title = "A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems",
    author = "Kim, San  and
      Jang, Jin Yea  and
      Jung, Minyoung  and
      Shin, Saim",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021",
    month = nov,
    year = "2021",
    address = "Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.findings-emnlp.33",
    doi = "10.18653/v1/2021.findings-emnlp.33",
    pages = "352--365",
    abstract = "Research on open-domain dialogue systems that allow free topics is challenging in the field of natural language processing (NLP). The performance of the dialogue system has been improved recently by the method utilizing dialogue-related knowledge; however, non-English dialogue systems suffer from reproducing the performance of English dialogue systems because securing knowledge in the same language with the dialogue system is relatively difficult. Through experiments with a Korean dialogue system, this paper proves that the performance of a non-English dialogue system can be improved by utilizing English knowledge, highlighting the system uses cross-lingual knowledge. For the experiments, we 1) constructed a Korean version of the Wizard of Wikipedia dataset, 2) built Korean-English T5 (KE-T5), a language model pre-trained with Korean and English corpus, and 3) developed a knowledge-grounded Korean dialogue model based on KE-T5. We observed the performance improvement in the open-domain Korean dialogue model even only English knowledge was given. The experimental results showed that the knowledge inherent in cross-lingual language models can be helpful for generating responses in open dialogue systems.",
}

模型相關信息

屬性	詳情
模型類型	預訓練的T5模型
訓練數據	韓語和英語語料
EOS令牌	""
標籤	t5

小工具示例

- 輸入文本: 아버지가 방에 들어가신다.</s>

精選推薦AI模型

Llama 3 Typhoon V1.5x 8b Instruct

專為泰語設計的80億參數指令模型，性能媲美GPT-3.5-turbo，優化了應用場景、檢索增強生成、受限生成和推理任務

Cadet-Tiny是一個基於SODA數據集訓練的超小型對話模型，專為邊緣設備推理設計，體積僅為Cosmo-3B模型的2%左右。

Roberta Base Chinese Extractive Qa

基於RoBERTa架構的中文抽取式問答模型，適用於從給定文本中提取答案的任務。

智啟未來，您的人工智能解決方案智庫