flan-t5-base-squad2開源問答模型 - 免費部署處理含無答案的問答對

首頁

Flan T5 Base Squad2

由sjrhuschlee開發

基於flan-t5-base模型，使用SQuAD2.0數據集微調的抽取式問答模型，可處理包含無答案問題的問答對。

問答系統

Transformers

英語開源協議:MIT #抽取式問答 #無答案檢測 #SQuAD2.0微調

下載量 2,425

發布時間 : 6/14/2023

模型概述

該模型專門用於英語抽取式問答任務，特別擅長處理SQuAD2.0數據集中的問題，包括判斷問題是否無答案的情況。

模型特點

無答案問題處理

通過特殊<cls>標記識別無答案情況，專門針對SQuAD2.0數據集優化

多數據集適配

在SQuAD、SQuAD2.0及多個變體數據集上表現良好

高效推理

在單個NVIDIA 3070顯卡上即可運行

模型能力

抽取式問答

無答案檢測

英語文本理解

使用案例

智能客服

常見問題解答

從知識庫中提取精確答案回答用戶問題

在SQuAD驗證集上達到86.37%的精確匹配率

教育輔助

閱讀理解評估

評估學生對文章內容的理解程度

在SQuAD2.0驗證集上F1分數達85.28

🚀 flan-t5-base用於抽取式問答

本項目採用經 SQuAD2.0 數據集微調的 flan-t5-base 模型，針對抽取式問答任務，在包含不可回答問題的問答對上進行訓練。

更新說明：在 transformers 4.31.0 版本之後，不再需要 use_remote_code=True。

注意事項：為使模型正常工作，必須手動在問題開頭添加 <cls> 標記。該模型使用 <cls> 標記來進行“無答案”預測。由於 t5 分詞器不會自動添加此特殊標記，因此需要手動添加。

🚀 快速開始

模型概述

屬性	詳情
語言模型	flan-t5-base
語言	英語
下游任務	抽取式問答
訓練數據	SQuAD 2.0
評估數據	SQuAD 2.0
基礎設施	1x NVIDIA 3070

模型使用示例

import torch
from transformers import(
  AutoModelForQuestionAnswering,
  AutoTokenizer,
  pipeline
)
model_name = "sjrhuschlee/flan-t5-base-squad2"

# a) 使用管道
nlp = pipeline(
  'question-answering',
  model=model_name,
  tokenizer=model_name,
  # trust_remote_code=True, # 如果 transformers 版本 >= 4.31.0 則無需使用
)
qa_input = {
'question': f'{nlp.tokenizer.cls_token}Where do I live?',  # '<cls>Where do I live?'
'context': 'My name is Sarah and I live in London'
}
res = nlp(qa_input)
# {'score': 0.980, 'start': 30, 'end': 37, 'answer': ' London'}

# b) 加載模型和分詞器
model = AutoModelForQuestionAnswering.from_pretrained(
  model_name,
  # trust_remote_code=True # 如果 transformers 版本 >= 4.31.0 則無需使用
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

question = f'{tokenizer.cls_token}Where do I live?'  # '<cls>Where do I live?'
context = 'My name is Sarah and I live in London'
encoding = tokenizer(question, context, return_tensors="pt")
output = model(
  encoding["input_ids"],
  attention_mask=encoding["attention_mask"]
)

all_tokens = tokenizer.convert_ids_to_tokens(encoding["input_ids"][0].tolist())
answer_tokens = all_tokens[torch.argmax(output["start_logits"]):torch.argmax(output["end_logits"]) + 1]
answer = tokenizer.decode(tokenizer.convert_tokens_to_ids(answer_tokens))
# 'London'

評估指標

# Squad v2
{
    "eval_HasAns_exact": 79.97638326585695,
    "eval_HasAns_f1": 86.1444296592862,
    "eval_HasAns_total": 5928,
    "eval_NoAns_exact": 84.42388561816652,
    "eval_NoAns_f1": 84.42388561816652,
    "eval_NoAns_total": 5945,
    "eval_best_exact": 82.2033184536343,
    "eval_best_exact_thresh": 0.0,
    "eval_best_f1": 85.28292588395921,
    "eval_best_f1_thresh": 0.0,
    "eval_exact": 82.2033184536343,
    "eval_f1": 85.28292588395928,
    "eval_runtime": 522.0299,
    "eval_samples": 12001,
    "eval_samples_per_second": 22.989,
    "eval_steps_per_second": 0.96,
    "eval_total": 11873
}

# Squad
{
    "eval_exact_match": 86.3197729422895,
    "eval_f1": 92.94686836210295,
    "eval_runtime": 442.1088,
    "eval_samples": 10657,
    "eval_samples_per_second": 24.105,
    "eval_steps_per_second": 1.007
}