オープンソースのre2g - qry - encoder - feverモデル - エンコードされた問題ベクトルで知識密集型タスクの検索を支援

ホーム

Re2g Qry Encoder Fever

ibm-researchによって開発

Re2Gは、知識集約型タスクのためのニューラル初期検索と再ランキングを組み合わせた生成モデルです。この質問エンコーダーは、検索のために質問をベクトルにエンコードするRe2Gシステムの構成要素です。

テキスト埋め込み

Transformers

オープンソースライセンス:Apache-2.0 #検索拡張生成 #マルチソース再ランキング #知識蒸留

ダウンロード数 17

リリース時間 : 8/1/2022

モデル概要

このモデルはRe2Gシステムのクエリエンコーディングコンポーネントで、DPRアーキテクチャに基づいており、自然言語の質問をベクトル表現にエンコードし、パッセージエンコーダーと連携して情報検索と再ランキングを行います。

モデル特徴

エンドツーエンドトレーニング

知識蒸留法により初期検索、再ランキング、ジェネレーターの共同トレーニングを実現

マルチソース検索統合

BM25やニューラル初期検索など異なるソースの検索結果を統合可能

知識集約型タスク最適化

QA、ファクトチェックなど大量の知識を必要とするタスク向けに特別設計

モデル能力

質問エンコーディング

情報検索

検索結果再ランキング

使用事例

知識集約型タスク

ゼロショットスロットフィリング

特定のトレーニングデータなしで構造化スロットを埋める

以前のSOTA比9%-34%向上

ファクトチェック

主張の真偽を検証

質問応答システム

外部知識が必要な複雑な質問に回答

🚀 Re2GにおけるFEVER質問エンコーダのモデルカード

このモデルは、質問をベクトルにエンコードし、近似最近傍探索インデックスへのクエリとして使用するタスクに役立ちます。RAG、Multi - DPR、KGIなどのアプローチと同様に、ニューラルIR（情報検索）コンポーネントをトレーニングし、正しい出力を生成する際の影響を通じてエンドツーエンドでさらにトレーニングします。

🚀 クイックスタート

このモデルの使用に関するコードや詳細な情報は、以下のGitHubリポジトリのre2gブランチにあります。 re2gブランチ

✨ 主な機能

RAG、Multi - DPR、KGIなどのアプローチに基づき、ニューラルIRコンポーネントをトレーニングします。
質問をベクトルにエンコードし、近似最近傍探索インデックスへのクエリとして使用できます。
再ランキングアプローチにより、スコアが比較できないソースからの検索結果を統合できます。

📦 インストール

インストールに関する具体的な手順は、GitHubリポジトリのre2gブランチを参照してください。 re2gブランチ

💻 使用例

基本的な使用法

このモデルを使用する最良の方法は、dpr_apply.pyを適応させることです。

📚 ドキュメント

モデル詳細

RAG、Multi - DPR、およびKGIのアプローチは、ニューラルIR（情報検索）コンポーネントをトレーニングし、正しい出力を生成する際の影響を通じてエンドツーエンドでさらにトレーニングするものです。

トレーニング、評価、推論

トレーニング、評価、および推論のコードは、GitHubのre2gブランチにあります。

モデルの使用

このモデルは、質問をベクトルにエンコードし、近似最近傍探索インデックスへのクエリとして使用するタスクに使用できます。ただし、パッセージをベクトルにエンコードしてインデックス化するコンテキストエンコーダと組み合わせて使用する必要があります。

引用

@inproceedings{glass-etal-2022-re2g,
    title = "{R}e2{G}: Retrieve, Rerank, Generate",
    author = "Glass, Michael  and
      Rossiello, Gaetano  and
      Chowdhury, Md Faisal Mahbub  and
      Naik, Ankita  and
      Cai, Pengshan  and
      Gliozzo, Alfio",
    booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = jul,
    year = "2022",
    address = "Seattle, United States",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.naacl-main.194",
    doi = "10.18653/v1/2022.naacl-main.194",
    pages = "2701--2715",
    abstract = "As demonstrated by GPT - 3 and T5, transformers grow in capability as parameter spaces become larger and larger. However, for tasks that require a large amount of knowledge, non - parametric memory allows models to grow dramatically with a sub - linear increase in computational cost and GPU memory requirements. Recent models such as RAG and REALM have introduced retrieval into conditional generation. These models incorporate neural initial retrieval from a corpus of passages. We build on this line of research, proposing Re2G, which combines both neural initial retrieval and reranking into a BART - based sequence - to - sequence generation. Our reranking approach also permits merging retrieval results from sources with incomparable scores, enabling an ensemble of BM25 and neural initial retrieval. To train our system end - to - end, we introduce a novel variation of knowledge distillation to train the initial retrieval, reranker and generation using only ground truth on the target sequence output. We find large gains in four diverse tasks: zero - shot slot filling, question answering, fact checking and dialog, with relative gains of 9{\%} to 34{\%} over the previous state - of - the - art on the KILT leaderboard. We make our code available as open source.",
}

📄 ライセンス

このモデルは、Apache 2.0ライセンスの下で提供されています。

📚 詳細情報

属性	详情
開発者	IBM
モデルタイプ	クエリ/パッセージ再ランキング
言語	英語
ライセンス	Apache 2.0
親モデル	[dpr - question_encoder - multiset - base](https://huggingface.co/facebook/dpr - question_encoder - multiset - base)
詳細情報リソース	[GitHubリポジトリ](https://github.com/IBM/kgi - slot - filling)、[関連論文](https://aclanthology.org/2022.naacl - main.194.pdf)