longformer-base-4096-finetuned-squadv1開源問答模型

首頁

Longformer Base 4096 Finetuned Squadv1

由valhalla開發

基於LONGFORMER-BASE-4096模型在SQuAD v1問答數據集上的微調版本，適用於處理長文檔問答任務

問答系統開源協議:MIT #長文本問答 #全局注意力機制 #SQuAD微調

下載量 806

發布時間 : 3/2/2022

模型概述

該模型是Longformer在SQuAD v1數據集上微調後的版本，專門用於問答任務，能夠處理最長4096個標記的序列。

模型特點

長文檔處理能力

能夠處理最長4096個標記的序列，適合長文檔問答任務。

全局注意力機制

在問答任務中，自動為問題標記設置全局注意力，提升問答準確性。

高效訓練

採用滑動窗口局部注意力機制，降低計算複雜度，提升訓練效率。

模型能力

長文檔問答

文本理解

答案提取

使用案例

問答系統

閱讀理解

從長文檔中提取問題的答案

精確匹配率85.1466，F1分數91.5415

🚀 LONGFORMER-BASE-4096在SQuAD v1上微調

這是一個在SQuAD v1數據集上針對問答任務進行微調的longformer-base-4096模型。該模型能夠有效處理問答場景，為長文本問答提供了強大的支持。

數據集

squad_v1

許可證

🚀 快速開始

本模型是基於Transformer架構的問答模型，可用於處理長文本的問答任務。以下是使用該模型的示例代碼：

import torch
from transformers import AutoTokenizer, AutoModelForQuestionAnswering

tokenizer = AutoTokenizer.from_pretrained("valhalla/longformer-base-4096-finetuned-squadv1")
model = AutoModelForQuestionAnswering.from_pretrained("valhalla/longformer-base-4096-finetuned-squadv1")

text = "Huggingface has democratized NLP. Huge thanks to Huggingface for this."
question = "What has Huggingface done ?"
encoding = tokenizer(question, text, return_tensors="pt")
input_ids = encoding["input_ids"]

# default is local attention everywhere
# the forward method will automatically set global attention on question tokens
attention_mask = encoding["attention_mask"]

start_scores, end_scores = model(input_ids, attention_mask=attention_mask)
all_tokens = tokenizer.convert_ids_to_tokens(input_ids[0].tolist())

answer_tokens = all_tokens[torch.argmax(start_scores) :torch.argmax(end_scores)+1]
answer = tokenizer.decode(tokenizer.convert_tokens_to_ids(answer_tokens))
# output => democratized NLP

目前，LongformerForQuestionAnswering 還不支持 pipeline。待支持添加後，我會更新此文檔。

✨ 主要特性

長文本處理能力：預訓練模型可以處理長達4096個標記的序列，適合處理長文檔問答。
自動處理全局注意力：LongformerForQuestionAnswering 模型會自動為問題標記設置全局注意力。

📦 安裝指南

文檔未提及安裝步驟，故跳過此章節。

💻 使用示例

基礎用法

import torch
from transformers import AutoTokenizer, AutoModelForQuestionAnswering

tokenizer = AutoTokenizer.from_pretrained("valhalla/longformer-base-4096-finetuned-squadv1")
model = AutoModelForQuestionAnswering.from_pretrained("valhalla/longformer-base-4096-finetuned-squadv1")

text = "Huggingface has democratized NLP. Huge thanks to Huggingface for this."
question = "What has Huggingface done ?"
encoding = tokenizer(question, text, return_tensors="pt")
input_ids = encoding["input_ids"]

# default is local attention everywhere
# the forward method will automatically set global attention on question tokens
attention_mask = encoding["attention_mask"]

start_scores, end_scores = model(input_ids, attention_mask=attention_mask)
all_tokens = tokenizer.convert_ids_to_tokens(input_ids[0].tolist())

answer_tokens = all_tokens[torch.argmax(start_scores) :torch.argmax(end_scores)+1]
answer = tokenizer.decode(tokenizer.convert_tokens_to_ids(answer_tokens))
# output => democratized NLP