roberta-base-squad2 Open-source English Q&A Model - Supports Framework Conversion, Accurately Extracts Answers

Roberta Base Squad2

Developed by optimum

English extractive QA model based on RoBERTa-base, trained on SQuAD 2.0 dataset, supports FARM and Transformers framework conversion

Question Answering System

Transformers

English#SQuAD2.0 QA #High-precision extractive QA #English NLP

Downloads 99

Release Time : 3/24/2022

Model Overview

This model is an English QA model based on the RoBERTa-base architecture, specifically designed for extracting answers from given texts. Supports unanswerable question detection and is a high-performance model in the SQuAD 2.0 benchmark.

Model Features

Dual-framework support

Supports FARM and Transformers framework conversion for flexible usage

Unanswerable detection

Can determine if a question has no answer in the given text

High-performance

Achieves F1 score of 82.91 on SQuAD 2.0 benchmark

Distilled version available

Offers tinyroberta-squad2 distilled version with 2x speed boost while maintaining similar quality

Model Capabilities

Text understanding

Answer extraction

Unanswerable detection

English QA

Use Cases

Customer service

FAQ auto-response

Automatically extracts answers from knowledge base documents

Reduces manual customer service workload

Document analysis

Contract information extraction

Quickly locates key clause information from legal documents

Improves legal document review efficiency

🚀 ONNX convert roberta-base for QA

This project focuses on converting the deepset/roberta-base-squad2 model, aiming to provide an efficient solution for extractive question - answering tasks in English.

🚀 Quick Start

Conversion of deepset/roberta-base-squad2

⚠️ Important Note This is version 2 of the model. See this github issue from the FARM repository for an explanation of why we updated. If you'd like to use version 1, specify revision="v1.0" when loading the model in Transformers 3.5. For example:

model_name = "deepset/roberta-base-squad2"
pipeline(model=model_name, tokenizer=model_name, revision="v1.0", task="question-answering")

✨ Features

Language model: roberta-base
Language: English
Downstream-task: Extractive QA
Training data: SQuAD 2.0
Eval data: SQuAD 2.0
Code: See example in FARM
Infrastructure: 4x Tesla v100

Property	Details
Model Type	roberta-base
Training Data	SQuAD 2.0

📦 Installation

No specific installation steps are provided in the original README.

💻 Usage Examples

In Transformers

from transformers import AutoModelForQuestionAnswering, AutoTokenizer, pipeline

model_name = "deepset/roberta-base-squad2"

# a) Get predictions
nlp = pipeline('question-answering', model=model_name, tokenizer=model_name)
QA_input = {
    'question': 'Why is model conversion important?',
    'context': 'The option to convert models between FARM and transformers gives freedom to the user and let people easily switch between frameworks.'
}
res = nlp(QA_input)

# b) Load model & tokenizer
model = AutoModelForQuestionAnswering.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)

In FARM

from farm.modeling.adaptive_model import AdaptiveModel
from farm.modeling.tokenization import Tokenizer
from farm.infer import Inferencer

model_name = "deepset/roberta-base-squad2"

# a) Get predictions
nlp = Inferencer.load(model_name, task_type="question_answering")
QA_input = [{"questions": ["Why is model conversion important?"],
             "text": "The option to convert models between FARM and transformers gives freedom to the user and let people easily switch between frameworks."}]
res = nlp.inference_from_dicts(dicts=QA_input, rest_api_schema=True)

# b) Load model & tokenizer
model = AdaptiveModel.convert_from_transformers(model_name, device="cpu", task_type="question_answering")
tokenizer = Tokenizer.load(model_name)

In haystack

For doing QA at scale (i.e. many docs instead of single paragraph), you can load the model also in haystack:

reader = FARMReader(model_name_or_path="deepset/roberta-base-squad2")
# or 
reader = TransformersReader(model_name_or_path="deepset/roberta-base-squad2",tokenizer="deepset/roberta-base-squad2")

🔧 Technical Details

Hyperparameters

batch_size = 96
n_epochs = 2
base_LM_model = "roberta-base"
max_seq_len = 386
learning_rate = 3e-5
lr_schedule = LinearWarmup
warmup_proportion = 0.2
doc_stride=128
max_query_length=64

Using a distilled model instead

💡 Usage Tip Please note that we have also released a distilled version of this model called deepset/tinyroberta-squad2. The distilled model has a comparable prediction quality and runs at twice the speed of the base model.

Performance

Evaluated on the SQuAD 2.0 dev set with the official eval script.

"exact": 79.87029394424324,
"f1": 82.91251169582613,

"total": 11873,
"HasAns_exact": 77.93522267206478,
"HasAns_f1": 84.02838248389763,
"HasAns_total": 5928,
"NoAns_exact": 81.79983179142137,
"NoAns_f1": 81.79983179142137,
"NoAns_total": 5945

📄 License

This project is licensed under the cc-by-4.0 license.

👥 Authors

Branden Chan: branden.chan [at] deepset.ai
Timo Möller: timo.moeller [at] deepset.ai
Malte Pietsch: malte.pietsch [at] deepset.ai
Tanay Soni: tanay.soni [at] deepset.ai

👋 About us

We bring NLP to the industry via open source!
Our focus: Industry specific language models & large scale QA systems.

Some of our work:

Get in touch: Twitter | LinkedIn | Slack | GitHub Discussions | Website

By the way: we're hiring!

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご