re2g-qry-encoder-nq開源模型 - 用於知識任務的端到端問題編碼工具

首頁

Re2g Qry Encoder Nq

由ibm-research開發

Re2G是一個結合神經檢索、重排序和生成的端到端系統，用於知識密集型任務。該模型是其NQ（Natural Questions）問題編碼器組件。

問答系統

Transformers

開源協議:Apache-2.0 #檢索增強生成 #多源重排序 #知識蒸餾

下載量 14

發布時間 : 7/29/2022

模型概述

該模型是一個查詢/段落重排序器，用於將問題編碼為向量以進行信息檢索。它是Re2G系統的關鍵組件，支持BM25和神經檢索的集成。

模型特點

神經檢索與重排序集成

將神經初始檢索和重排序結合到序列生成中，支持不同檢索方法的得分整合

端到端知識蒸餾

僅使用目標序列輸出的真實情況來訓練檢索、重排序和生成組件

多任務適配性

在零樣本槽填充、問答、事實核查和對話等多種任務中表現優異

模型能力

問題編碼

信息檢索

段落重排序

知識密集型任務處理

使用案例

知識獲取

零樣本槽填充

在沒有特定訓練數據的情況下填充信息槽位

相比之前技術提升9%-34%

開放域問答

回答需要外部知識的事實性問題

內容驗證

事實核查

驗證陳述的事實準確性

🚀 Re2G中NQ問題編碼器的模型卡片

本模型主要用於將問題編碼為向量，可作為近似最近鄰索引的查詢向量，結合上下文編碼器對段落進行編碼和索引，在信息檢索、問答等任務中表現出色。

🚀 快速開始

訓練、評估和推理的代碼可在我們的GitHub的 re2g分支中找到。使用該模型的最佳方式是調整 dpr_apply.py。

✨ 主要特性

RAG、Multi - DPR和KGI的方法是訓練一個神經信息檢索（IR）組件，並通過其在生成正確輸出中的影響進行端到端的進一步訓練。

模型架構

📚 詳細文檔

模型詳情

RAG、Multi - DPR和KGI的方法是訓練一個神經信息檢索（IR）組件，並通過其在生成正確輸出中的影響進行端到端的進一步訓練。

訓練、評估和推理

訓練、評估和推理的代碼在我們GitHub的 re2g分支中。

使用方法

使用該模型的最佳方式是調整 dpr_apply.py。

用途

直接使用

該模型可用於將問題編碼為向量，作為近似最近鄰索引的查詢向量。它必須與將段落編碼為向量並進行索引的上下文編碼器結合使用。

模型描述

模型創建者在相關論文中指出：正如GPT - 3和T5所展示的，隨著參數空間越來越大，Transformer的能力也在不斷增強。然而，對於需要大量知識的任務，非參數內存允許模型以亞線性的計算成本和GPU內存需求大幅增長。最近的模型如RAG和REALM已將檢索引入條件生成。這些模型包含從段落語料庫進行的神經初始檢索。我們在此研究基礎上，提出了Re2G，它將神經初始檢索和重排序結合到基於BART的序列到序列生成中。我們的重排序方法還允許合併來自分數不可比來源的檢索結果，從而實現BM25和神經初始檢索的集成。為了端到端地訓練我們的系統，我們引入了一種新穎的知識蒸餾變體，僅使用目標序列輸出的真實標籤來訓練初始檢索、重排序器和生成器。我們在四個不同的任務中取得了顯著的收益：零樣本插槽填充、問答、事實核查和對話，在KILT排行榜上相對於之前的最先進技術有9%到34%的相對提升。我們將代碼開源。

屬性	詳情
開發者	IBM
共享者	IBM
模型類型	查詢/段落重排序器
語言（NLP）	英語
許可證	Apache 2.0
父模型	dpr - question_encoder - multiset - base
更多信息資源	GitHub倉庫、相關論文

引用

@inproceedings{glass-etal-2022-re2g,
    title = "{R}e2{G}: Retrieve, Rerank, Generate",
    author = "Glass, Michael  and
      Rossiello, Gaetano  and
      Chowdhury, Md Faisal Mahbub  and
      Naik, Ankita  and
      Cai, Pengshan  and
      Gliozzo, Alfio",
    booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = jul,
    year = "2022",
    address = "Seattle, United States",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.naacl-main.194",
    doi = "10.18653/v1/2022.naacl-main.194",
    pages = "2701--2715",
    abstract = "As demonstrated by GPT-3 and T5, transformers grow in capability as parameter spaces become larger and larger. However, for tasks that require a large amount of knowledge, non-parametric memory allows models to grow dramatically with a sub-linear increase in computational cost and GPU memory requirements. Recent models such as RAG and REALM have introduced retrieval into conditional generation. These models incorporate neural initial retrieval from a corpus of passages. We build on this line of research, proposing Re2G, which combines both neural initial retrieval and reranking into a BART-based sequence-to-sequence generation. Our reranking approach also permits merging retrieval results from sources with incomparable scores, enabling an ensemble of BM25 and neural initial retrieval. To train our system end-to-end, we introduce a novel variation of knowledge distillation to train the initial retrieval, reranker and generation using only ground truth on the target sequence output. We find large gains in four diverse tasks: zero-shot slot filling, question answering, fact checking and dialog, with relative gains of 9{\%} to 34{\%} over the previous state-of-the-art on the KILT leaderboard. We make our code available as open source.",
}