Electra-base-squad2 Open-source Question Answering Model - Suitable for English Q&A tasks, facilitating information acquisition

Electra Base Squad2

Developed by bhadresh-savani

This is an English extractive question answering model based on the ELECTRA-base architecture, trained on the SQuAD 2.0 dataset, suitable for question answering tasks.

Question Answering System #Extractive Question Answering #High-precision Reading Comprehension #ELECTRA Architecture

Downloads 102

Release Time : 4/13/2022

Model Overview

This model uses the ELECTRA-base architecture and is specifically designed for English extractive question answering tasks. It can extract answers from given text or determine if a question is unanswerable.

Model Features

High-performance Question Answering

Achieves an F1 score of 81.35 on the SQuAD 2.0 development set, demonstrating excellent performance.

Supports Unanswerable Questions

Capable of handling unanswerable questions and performs well on such cases.

Multi-framework Support

Compatible with various frameworks including Transformers, FARM, and Haystack.

Model Capabilities

Text Understanding

Question Answering

Unanswerable Question Detection

Use Cases

Intelligent Customer Service

Automated Customer Query Responses

Automatically extracts answers to customer questions from knowledge base documents

Improves customer service efficiency and reduces manual intervention

Educational Assistance

Learning Material Q&A

Helps students quickly find answers to questions from textbook content

Enhances learning efficiency

🚀 Electra-base for QA

This is an Electra-base model fine-tuned for extractive question answering on the SQuAD 2.0 dataset.

🚀 Quick Start

Prerequisites

The model is based on the English language and trained on SQuAD 2.0 for extractive question answering.
The infrastructure used for training is 1x Tesla v100.

Code Example

You can refer to the example in FARM.

✨ Features

Language Model: electra-base
Language: English
Downstream-task: Extractive QA
Training data: SQuAD 2.0
Eval data: SQuAD 2.0
Infrastructure: 1x Tesla v100

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

In Transformers

from transformers import AutoModelForQuestionAnswering, AutoTokenizer, pipeline

model_name = "deepset/electra-base-squad2"

# a) Get predictions
nlp = pipeline('question-answering', model=model_name, tokenizer=model_name)
QA_input = {
    'question': 'Why is model conversion important?',
    'context': 'The option to convert models between FARM and transformers gives freedom to the user and let people easily switch between frameworks.'
}
res = nlp(QA_input)

# b) Load model & tokenizer
model = AutoModelForQuestionAnswering.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)

In FARM

from farm.modeling.adaptive_model import AdaptiveModel
from farm.modeling.tokenization import Tokenizer
from farm.infer import Inferencer

model_name = "deepset/electra-base-squad2"

# a) Get predictions
nlp = Inferencer.load(model_name, task_type="question_answering")
QA_input = [{"questions": ["Why is model conversion important?"],
             "text": "The option to convert models between FARM and transformers gives freedom to the user and let people easily switch between frameworks."}]
res = nlp.inference_from_dicts(dicts=QA_input)

# b) Load model & tokenizer
model = AdaptiveModel.convert_from_transformers(model_name, device="cpu", task_type="question_answering")
tokenizer = Tokenizer.load(model_name)

In haystack

reader = FARMReader(model_name_or_path="deepset/electra-base-squad2")
# or
reader = TransformersReader(model="deepset/electra-base-squad2",tokenizer="deepset/electra-base-squad2")

📚 Documentation

Hyperparameters

seed=42
batch_size = 32
n_epochs = 5
base_LM_model = "google/electra-base-discriminator"
max_seq_len = 384
learning_rate = 1e-4
lr_schedule = LinearWarmup
warmup_proportion = 0.1
doc_stride=128
max_query_length=64

Performance

Evaluated on the SQuAD 2.0 dev set with the official eval script.

"exact": 77.30144024256717,
 "f1": 81.35438272008543,
 "total": 11873,
 "HasAns_exact": 74.34210526315789,
 "HasAns_f1": 82.45961302894314,
 "HasAns_total": 5928,
 "NoAns_exact": 80.25231286795626,
 "NoAns_f1": 80.25231286795626,
 "NoAns_total": 5945

🔧 Technical Details

The model is fine-tuned on the SQuAD 2.0 dataset for extractive question answering. The hyperparameters are carefully selected to achieve good performance on the evaluation set.

📄 License

The license for this model is cc-by-4.0.

📝 Authors

Vaishali Pal vaishali.pal [at] deepset.ai
Branden Chan: branden.chan [at] deepset.ai
Timo Möller: timo.moeller [at] deepset.ai
Malte Pietsch: malte.pietsch [at] deepset.ai
Tanay Soni: tanay.soni [at] deepset.ai

Note

Borrowed this model from Haystack model repo for adding tensorflow model.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご