nli-entailment-verifier-xxl開源模型 - 免費驗證前提是否支持假設，多句場景更優化

首頁

Nli Entailment Verifier Xxl

由soumyasanyal開發

基於flan-t5-xxl微調的NLI模型，用於驗證前提是否支持假設，特別優化多句前提場景

大型語言模型

Transformers

英語#自然語言推理 #蘊涵驗證 #思維鏈推理

下載量 164

發布時間 : 1/11/2024

模型概述

該模型專門用於自然語言推理(NLI)任務，可驗證給定前提是否支持某個假設，適用於思維鏈推理等複雜場景。針對多句前提進行了優化訓練。

模型特點

多句前提優化

專門針對多句前提場景進行訓練，適合複雜推理任務

量化支持

支持4位/8位量化以減少GPU內存使用

排序目標微調

採用排序目標進行微調，能從假設對中選出最受支持的假設

模型能力

自然語言推理

蘊涵驗證

邏輯關係判斷

思維鏈推理支持

使用案例

自然語言處理

學術論文推理驗證

驗證研究論文中的結論是否被前提支持

可提供邏輯支持程度的量化評分

法律文書分析

分析法律條文與案件事實之間的邏輯關係

判斷法律條文是否支持特定案件結論

教育評估

學生答案評分

評估學生答案是否被問題前提充分支持

提供答案邏輯正確性的量化評分

🚀 nli-entailment-verifier-xxl

nli-entailment-verifier-xxl 是一個用於驗證給定前提是否支持假設的模型。它基於 flan-t5-xxl 模型進行微調，以排序為目標（從給定前提的一對假設中對最受支持的假設進行排序）。該模型適用於自然語言推理（NLI）風格的數據集和思維鏈（CoT）推理，尤其經過專門訓練以處理多句子前提，這與現代大語言模型（LLM）應用場景中的需求相符。

🚀 快速開始

模型描述

nli-entailment-verifier-xxl 基於 flan-t5-xxl 模型，並通過排序目標進行微調（針對給定前提，從給定的一對假設中對最受支持的假設進行排序）。更多詳細信息請參考我們的論文 Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification。

該模型旨在驗證給定前提是否支持某個假設，適用於自然語言推理（NLI）風格的數據集和思維鏈（CoT）推理。此模型經過專門訓練，能夠處理多句子前提（類似於我們在思維鏈推理和其他現代大語言模型用例中所期望的情況）。

⚠️ 重要提示

你可以使用 4 位/8 位量化來減少 GPU 內存使用。

💻 使用示例

基礎用法

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
import torch

def get_score(model, tokenizer, input_ids):
    pos_ids = tokenizer('Yes').input_ids
    neg_ids = tokenizer('No').input_ids
    pos_id = pos_ids[0]
    neg_id = neg_ids[0]
    
    logits = model(input_ids, decoder_input_ids=torch.zeros((input_ids.size(0), 1), dtype=torch.long)).logits
    pos_logits = logits[:, 0, pos_id]
    neg_logits = logits[:, 0, neg_id]
    posneg_logits = torch.cat([pos_logits.unsqueeze(-1), neg_logits.unsqueeze(-1)], dim=1)
    scores = torch.nn.functional.softmax(posneg_logits, dim=1)[:, 0]
    return scores

tokenizer = AutoTokenizer.from_pretrained('google/flan-t5-xxl')
model = AutoModelForSeq2SeqLM.from_pretrained('soumyasanyal/nli-entailment-verifier-xxl')

premise = "A fossil fuel is a kind of natural resource. Coal is a kind of fossil fuel."
hypothesis = "Coal is a kind of natural resource."
prompt = f"Premise: {premise}\nHypothesis: {hypothesis}\nGiven the premise, is the hypothesis correct?\nAnswer:"

input_ids = tokenizer(prompt, return_tensors='pt').input_ids

scores = get_score(model, tokenizer, input_ids)
print(f'Hypothesis entails the premise: {bool(scores >= 0.5)}')