t5-small-squad-qg-ae Open-Source English Model - Extremely Practical for Generating Questions and Extracting Answers from Text

T5 Small Squad Qg Ae

Developed by lmqg

A T5-small fine-tuned model for English question generation and answer extraction, suitable for generating questions from text or extracting answers.

Question Answering System

Transformers

English#English Q&A Generation #Answer Extraction #Educational Assistance

Downloads 685

Release Time : 3/2/2022

Model Overview

This model is a text-to-text generation model based on the T5-small architecture, fine-tuned on the SQuAD dataset, specifically designed to generate relevant questions or extract answer segments from given text.

Model Features

Joint Task Processing

A single model supports both question generation and answer extraction tasks simultaneously.

High-Quality Generation

Fine-tuned on the SQuAD dataset, generating high-quality questions and extracted answers.

Lightweight Model

Based on the T5-small architecture, the model is compact with high inference efficiency.

Model Capabilities

Text Generation

Question Generation

Answer Extraction

Text Comprehension

Use Cases

Education

Automated Test Question Generation

Automatically generates test questions from textbook content

Generated questions achieve a BLEU4 score of 24.18 and ROUGE-L score of 51.12

Q&A Systems

Document Q&A Preprocessing

Generates potential question-answer pairs for document content

Answer extraction achieves an F1 score of 66.92 and exact match of 54.17

🚀 Model Card of `lmqg/t5-small-squad-qg-ae`

This model is a fine - tuned version of t5 - small for joint question generation and answer extraction. It is trained on the lmqg/qg_squad (dataset_name: default) via lmqg.

🚀 Quick Start

Overview

Property	Details
Language model	t5 - small
Language	en
Training Data	lmqg/qg_squad (default)
Online Demo	https://autoqg.net/
Repository	https://github.com/asahi417/lm-question-generation
Paper	https://arxiv.org/abs/2210.03992

✨ Features

This model supports both question generation and answer extraction tasks, which can be useful in various natural language processing applications such as information retrieval and question - answering systems.

📦 Installation

The model can be used with the following libraries. You need to install the corresponding libraries first. For example, you can use pip to install lmqg and transformers:

pip install lmqg transformers

💻 Usage Examples

Basic Usage

With lmqg

from lmqg import TransformersQG

# initialize model
model = TransformersQG(language="en", model="lmqg/t5-small-squad-qg-ae")

# model prediction
question_answer_pairs = model.generate_qa("William Turner was an English painter who specialised in watercolour landscapes")

With transformers

from transformers import pipeline

pipe = pipeline("text2text-generation", "lmqg/t5-small-squad-qg-ae")

# answer extraction
answer = pipe("generate question: <hl> Beyonce <hl> further expanded her acting career, starring as blues singer Etta James in the 2008 musical biopic, Cadillac Records.")

# question generation
question = pipe("extract answers: <hl> Beyonce further expanded her acting career, starring as blues singer Etta James in the 2008 musical biopic, Cadillac Records. <hl> Her performance in the film received praise from critics, and she garnered several nominations for her portrayal of James, including a Satellite Award nomination for Best Supporting Actress, and a NAACP Image Award nomination for Outstanding Supporting Actress.")

📚 Documentation

Evaluation

Metric (Question Generation): raw metric file

Metric	Score	Type	Dataset
BERTScore	90.18	default	lmqg/qg_squad
Bleu_1	56.54	default	lmqg/qg_squad
Bleu_2	40.31	default	lmqg/qg_squad
Bleu_3	30.8	default	lmqg/qg_squad
Bleu_4	24.18	default	lmqg/qg_squad
METEOR	25.58	default	lmqg/qg_squad
MoverScore	63.72	default	lmqg/qg_squad
ROUGE_L	51.12	default	lmqg/qg_squad

Metric (Question & Answer Generation): raw metric file

Metric	Score	Type	Dataset
QAAlignedF1Score (BERTScore)	91.74	default	lmqg/qg_squad
QAAlignedF1Score (MoverScore)	63.23	default	lmqg/qg_squad
QAAlignedPrecision (BERTScore)	91.49	default	lmqg/qg_squad
QAAlignedPrecision (MoverScore)	63.26	default	lmqg/qg_squad
QAAlignedRecall (BERTScore)	92.01	default	lmqg/qg_squad
QAAlignedRecall (MoverScore)	63.29	default	lmqg/qg_squad

Metric (Answer Extraction): raw metric file

Metric	Score	Type	Dataset
AnswerExactMatch	54.17	default	lmqg/qg_squad
AnswerF1Score	66.92	default	lmqg/qg_squad
BERTScore	90.77	default	lmqg/qg_squad
Bleu_1	40.81	default	lmqg/qg_squad
Bleu_2	35.84	default	lmqg/qg_squad
Bleu_3	31.06	default	lmqg/qg_squad
Bleu_4	27.06	default	lmqg/qg_squad
METEOR	40.9	default	lmqg/qg_squad
MoverScore	79.49	default	lmqg/qg_squad
ROUGE_L	66.52	default	lmqg/qg_squad

Training hyperparameters

The following hyperparameters were used during fine - tuning:

dataset_path: lmqg/qg_squad
dataset_name: default
input_types: ['paragraph_answer', 'paragraph_sentence']
output_types: ['question', 'answer']
prefix_types: ['qg', 'ae']
model: t5 - small
max_length: 512
max_length_output: 32
epoch: 7
batch: 64
lr: 0.0001
fp16: False
random_seed: 1
gradient_accumulation_steps: 1
label_smoothing: 0.15

The full configuration can be found at fine - tuning config file.

🔧 Technical Details

The model is based on the t5 - small architecture and is fine - tuned on the lmqg/qg_squad dataset. The fine - tuning process involves optimizing the model for both question generation and answer extraction tasks using specific hyperparameters.

📄 License

This model is released under the CC - BY - 4.0 license.

Citation

@inproceedings{ushio-etal-2022-generative,
    title = "{G}enerative {L}anguage {M}odels for {P}aragraph-{L}evel {Q}uestion {G}eneration",
    author = "Ushio, Asahi  and
        Alva-Manchego, Fernando  and
        Camacho-Collados, Jose",
    booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, U.A.E.",
    publisher = "Association for Computational Linguistics",
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご