xor-tydi-docTquery-mt5-large開源模型 - 支持7種語言的段落到問題生成

首頁

Xor Tydi Doctquery Mt5 Large

由ielabgroup開發

基於mT5-large架構的多語言查詢生成模型，支持7種語言的段落到問題生成

問答系統

Transformers

開源協議:Apache-2.0 #跨語言查詢生成 #自注意力機制 #多語言問答系統

下載量 42

發布時間 : 5/7/2023

模型概述

該模型專門用於從給定段落生成多語言問題，採用自注意力機制處理輸入數據，適用於跨語言信息檢索場景。

模型特點

多語言支持

支持7種語言的查詢生成，包括亞洲和歐洲語系

段落理解

能有效理解輸入段落語義並生成相關問題

學術優化

專為跨語言信息檢索研究設計，已在相關論文中驗證效果

模型能力

文本生成

多語言處理

段落理解

問題生成

使用案例

信息檢索

跨語言文檔檢索

為不同語言文檔自動生成查詢問題，增強檢索系統效果

在XOR QA數據集上驗證有效

教育輔助

自動生成外語學習材料的問題

🚀 mT5-large查詢生成模型

本項目是一個基於mT5-large的查詢生成模型，使用XOR QA數據進行訓練。該模型可用於生成不同語言的查詢問題，在跨語言密集檢索等領域具有重要價值。

🚀 快速開始

本模型是一個使用XOR QA數據訓練的mT5-large查詢生成模型。它被應用於以下兩篇論文的研究中：

💻 使用示例

基礎用法

from transformers import pipeline

lang2mT5 = dict(
    ar='Arabic',
    bn='Bengali',
    fi='Finnish',
    ja='Japanese',
    ko='Korean',
    ru='Russian',
    te='Telugu'
)
PROMPT = 'Generate a {lang} question for this passage: {title} {passage}'

title = 'Transformer (machine learning model)'
passage = 'A transformer is a deep learning model that adopts the mechanism of self-attention, differentially ' \
          'weighting the significance of each part of the input (which includes the recursive output) data.'


model_name_or_path = 'ielabgroup/xor-tydi-docTquery-mt5-large'
input_text = PROMPT.format_map({'lang': lang2mT5['ja'],
                                'title': title,
                                'passage': passage})

generator = pipeline(model=model_name_or_path,
                     task='text2text-generation',
                     device="cuda:0",
                     )

results = generator(input_text,
                    do_sample=True,
                    max_length=64,
                    num_return_sequences=10,
                    )

for i, result in enumerate(results):
    print(f'{i + 1}. {result["generated_text"]}')

📄 許可證

本項目採用Apache-2.0許可證。

📚 引用信息

如果您在研究中使用了本模型，請參考以下BibTeX引用：

@article{zhuang2022bridging,
  title={Bridging the gap between indexing and retrieval for differentiable search index with query generation},
  author={Zhuang, Shengyao and Ren, Houxing and Shou, Linjun and Pei, Jian and Gong, Ming and Zuccon, Guido and Jiang, Daxin},
  journal={arXiv preprint arXiv:2206.10128},
  year={2022}
}

@inproceedings{zhuang2023augmenting,
	title={Augmenting Passage Representations with Query Generation for Enhanced Cross-Lingual Dense Retrieval},
	author={Zhuang, Shengyao and Shou, Linjun and Zuccon, Guido},
	booktitle={Proceedings of the 46th international ACM SIGIR conference on research and development in information retrieval},
	year={2023}
}