nli-entailment-verifier-xxl开源模型 - 免费验证前提是否支持假设，多句场景更优化

首页

Nli Entailment Verifier Xxl

由 soumyasanyal 开发

基于flan-t5-xxl微调的NLI模型，用于验证前提是否支持假设，特别优化多句前提场景

大型语言模型

Transformers

英语#自然语言推理 #蕴涵验证 #思维链推理

下载量 164

发布时间 : 1/11/2024

模型简介

该模型专门用于自然语言推理(NLI)任务，可验证给定前提是否支持某个假设，适用于思维链推理等复杂场景。针对多句前提进行了优化训练。

模型特点

多句前提优化

专门针对多句前提场景进行训练，适合复杂推理任务

量化支持

支持4位/8位量化以减少GPU内存使用

排序目标微调

采用排序目标进行微调，能从假设对中选出最受支持的假设

模型能力

自然语言推理

蕴涵验证

逻辑关系判断

思维链推理支持

使用案例

自然语言处理

学术论文推理验证

验证研究论文中的结论是否被前提支持

可提供逻辑支持程度的量化评分

法律文书分析

分析法律条文与案件事实之间的逻辑关系

判断法律条文是否支持特定案件结论

教育评估

学生答案评分

评估学生答案是否被问题前提充分支持

提供答案逻辑正确性的量化评分

🚀 nli-entailment-verifier-xxl

nli-entailment-verifier-xxl 是一个用于验证给定前提是否支持假设的模型。它基于 flan-t5-xxl 模型进行微调，以排序为目标（从给定前提的一对假设中对最受支持的假设进行排序）。该模型适用于自然语言推理（NLI）风格的数据集和思维链（CoT）推理，尤其经过专门训练以处理多句子前提，这与现代大语言模型（LLM）应用场景中的需求相符。

🚀 快速开始

模型描述

nli-entailment-verifier-xxl 基于 flan-t5-xxl 模型，并通过排序目标进行微调（针对给定前提，从给定的一对假设中对最受支持的假设进行排序）。更多详细信息请参考我们的论文 Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification。

该模型旨在验证给定前提是否支持某个假设，适用于自然语言推理（NLI）风格的数据集和思维链（CoT）推理。此模型经过专门训练，能够处理多句子前提（类似于我们在思维链推理和其他现代大语言模型用例中所期望的情况）。

⚠️ 重要提示

你可以使用 4 位/8 位量化来减少 GPU 内存使用。

💻 使用示例

基础用法

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
import torch

def get_score(model, tokenizer, input_ids):
    pos_ids = tokenizer('Yes').input_ids
    neg_ids = tokenizer('No').input_ids
    pos_id = pos_ids[0]
    neg_id = neg_ids[0]
    
    logits = model(input_ids, decoder_input_ids=torch.zeros((input_ids.size(0), 1), dtype=torch.long)).logits
    pos_logits = logits[:, 0, pos_id]
    neg_logits = logits[:, 0, neg_id]
    posneg_logits = torch.cat([pos_logits.unsqueeze(-1), neg_logits.unsqueeze(-1)], dim=1)
    scores = torch.nn.functional.softmax(posneg_logits, dim=1)[:, 0]
    return scores

tokenizer = AutoTokenizer.from_pretrained('google/flan-t5-xxl')
model = AutoModelForSeq2SeqLM.from_pretrained('soumyasanyal/nli-entailment-verifier-xxl')

premise = "A fossil fuel is a kind of natural resource. Coal is a kind of fossil fuel."
hypothesis = "Coal is a kind of natural resource."
prompt = f"Premise: {premise}\nHypothesis: {hypothesis}\nGiven the premise, is the hypothesis correct?\nAnswer:"

input_ids = tokenizer(prompt, return_tensors='pt').input_ids

scores = get_score(model, tokenizer, input_ids)
print(f'Hypothesis entails the premise: {bool(scores >= 0.5)}')