re2g-qry-encoder-nqオープンソースモデル - 知識タスク用のエンドツーエンド問題エンコーディングツール

ホーム

Re2g Qry Encoder Nq

ibm-researchによって開発

Re2Gは知識集約型タスクのためのニューラル検索、再ランキング、生成を統合したエンドツーエンドシステムです。このモデルはそのNQ（Natural Questions）質問エンコーダーコンポーネントです。

質問応答システム

Transformers

オープンソースライセンス:Apache-2.0 #検索拡張生成 #マルチソース再ランキング #知識蒸留

ダウンロード数 14

リリース時間 : 7/29/2022

モデル概要

このモデルは情報検索のために質問をベクトルにエンコードするクエリ/パッセージ再ランキングツールです。Re2Gシステムの重要なコンポーネントであり、BM25とニューラル検索の統合をサポートします。

モデル特徴

ニューラル検索と再ランキング統合

ニューラル初期検索と再ランキングをシーケンス生成に統合し、異なる検索方法のスコア統合をサポート

エンドツーエンド知識蒸留

ターゲットシーケンス出力の真実値のみを使用して検索、再ランキング、生成コンポーネントをトレーニング

マルチタスク適応性

ゼロショットスロットフィリング、質問応答、事実確認、対話など様々なタスクで優れた性能を発揮

モデル能力

質問エンコーディング

情報検索

パッセージ再ランキング

知識集約型タスク処理

使用事例

知識取得

ゼロショットスロットフィリング

特定のトレーニングデータなしで情報スロットを埋める

従来技術比9%-34%向上

オープンドメインQA

外部知識を必要とする事実質問に回答

コンテンツ検証

ファクトチェック

記述の事実正確性を検証

🚀 Re2GにおけるNQ質問エンコーダのモデルカード

RAG、Multi - DPR、KGIのアプローチは、ニューラルIR（情報検索）コンポーネントを学習し、正しい出力を生成する際の影響を通じてエンドツーエンドでさらに学習することです。

🚀 クイックスタート

このモデルを使用する最良の方法は、dpr_apply.py を適応させることです。

✨ 主な機能

このモデルは、質問をベクトルにエンコードし、近似最近傍探索インデックスへのクエリとして使用するタスクに利用できます。文章をベクトルにエンコードしてインデックス化するコンテキストエンコーダと組み合わせて使用する必要があります。

📦 インストール

学習、評価、推論のコードは、re2gブランチのGitHubリポジトリにあります。

📚 ドキュメント

モデルの詳細

RAG、Multi - DPR、およびKGIのアプローチは、ニューラルIR（情報検索）コンポーネントを学習し、正しい出力を生成する際の影響を通じてエンドツーエンドでさらに学習することです。

モデルの情報

属性	详情
開発者	IBM
共有者	IBM
モデルタイプ	クエリ/文章の再ランキング
言語 (NLP)	英語
ライセンス	Apache 2.0
親モデル	dpr - question_encoder - multiset - base
詳細情報リソース	GitHubリポジトリ、関連論文

引用

@inproceedings{glass-etal-2022-re2g,
    title = "{R}e2{G}: Retrieve, Rerank, Generate",
    author = "Glass, Michael  and
      Rossiello, Gaetano  and
      Chowdhury, Md Faisal Mahbub  and
      Naik, Ankita  and
      Cai, Pengshan  and
      Gliozzo, Alfio",
    booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = jul,
    year = "2022",
    address = "Seattle, United States",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.naacl-main.194",
    doi = "10.18653/v1/2022.naacl-main.194",
    pages = "2701--2715",
    abstract = "As demonstrated by GPT - 3 and T5, transformers grow in capability as parameter spaces become larger and larger. However, for tasks that require a large amount of knowledge, non - parametric memory allows models to grow dramatically with a sub - linear increase in computational cost and GPU memory requirements. Recent models such as RAG and REALM have introduced retrieval into conditional generation. These models incorporate neural initial retrieval from a corpus of passages. We build on this line of research, proposing Re2G, which combines both neural initial retrieval and reranking into a BART - based sequence - to - sequence generation. Our reranking approach also permits merging retrieval results from sources with incomparable scores, enabling an ensemble of BM25 and neural initial retrieval. To train our system end - to - end, we introduce a novel variation of knowledge distillation to train the initial retrieval, reranker and generation using only ground truth on the target sequence output. We find large gains in four diverse tasks: zero - shot slot filling, question answering, fact checking and dialog, with relative gains of 9{\%} to 34{\%} over the previous state - of - the - art on the KILT leaderboard. We make our code available as open source.",
}