ke-t5-large开源模型 - 支持韩语英语，跨语言知识驱动响应轻松生成

首页

Ke T5 Large

由 KETI-AIR 开发

基于韩语和英语预训练的T5模型，适用于跨语言知识驱动的响应生成任务

大型语言模型支持多种语言开源协议:Apache-2.0 #韩英双语模型 #开放域对话生成 #跨语言知识迁移

下载量 147

发布时间 : 3/2/2022

模型简介

该模型是一个基于T5架构的双语（韩语和英语）预训练模型，主要用于开放域对话系统中的跨语言响应生成任务。通过利用英语知识提升非英语对话系统的性能。

模型特点

跨语言知识迁移

能够利用英语知识提升韩语对话系统的性能

双语预训练

同时在韩语和英语语料上进行预训练

开放域对话优化

专门针对开放域对话场景进行优化

模型能力

文本生成

跨语言知识迁移

开放域对话响应生成

使用案例

对话系统

韩语开放域对话

用于韩语开放域对话系统中的响应生成

即使仅提供英语知识，也能提升韩语对话模型的性能

跨语言知识应用

将英语知识应用于非英语对话系统

证明跨语言模型内嵌的知识有助于开放对话系统中的响应生成

🚀 ke - t5 base

ke - t5 base是一个在韩语和英语上进行预训练的T5模型。该模型可用于处理涉及韩语和英语的自然语言处理任务，为跨语言的文本处理提供了强大的支持。如需了解更多详细信息，请查看 Github 和论文韩语论文。

🚀 快速开始

模型使用示例

以下是使用该模型的示例代码：

from transformers import AutoModel, AutoTokenizer

model = AutoModel.from_pretrained("KETI-AIR/ke-t5-large")
tokenizer = AutoTokenizer.from_pretrained("KETI-AIR/ke-t5-large")

📚 详细文档

BibTeX引用和引用信息

如果您在研究中使用了该模型，可以使用以下BibTeX条目进行引用：

@inproceedings{kim-etal-2021-model-cross,
    title = "A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems",
    author = "Kim, San  and
      Jang, Jin Yea  and
      Jung, Minyoung  and
      Shin, Saim",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021",
    month = nov,
    year = "2021",
    address = "Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.findings-emnlp.33",
    doi = "10.18653/v1/2021.findings-emnlp.33",
    pages = "352--365",
    abstract = "Research on open-domain dialogue systems that allow free topics is challenging in the field of natural language processing (NLP). The performance of the dialogue system has been improved recently by the method utilizing dialogue-related knowledge; however, non-English dialogue systems suffer from reproducing the performance of English dialogue systems because securing knowledge in the same language with the dialogue system is relatively difficult. Through experiments with a Korean dialogue system, this paper proves that the performance of a non-English dialogue system can be improved by utilizing English knowledge, highlighting the system uses cross-lingual knowledge. For the experiments, we 1) constructed a Korean version of the Wizard of Wikipedia dataset, 2) built Korean-English T5 (KE-T5), a language model pre-trained with Korean and English corpus, and 3) developed a knowledge-grounded Korean dialogue model based on KE-T5. We observed the performance improvement in the open-domain Korean dialogue model even only English knowledge was given. The experimental results showed that the knowledge inherent in cross-lingual language models can be helpful for generating responses in open dialogue systems.",
}