T5-xl-ssm-nq Open-Source Closed-Book Question Answering Model - Excellent Natural Question Answering Supported by Pretraining and Fine-Tuning

T5 Xl Ssm Nq

Developed by google

T5-xl-ssm-nq is a closed-book QA model based on the T5 architecture, excelling in natural questions through pretraining and fine-tuning.

Question Answering System

Transformers

EnglishOpen Source License:Apache-2.0 #Closed-book QA #Knowledge compression #Retrieval-free generation

Downloads 16

Release Time : 3/2/2022

Model Overview

This model is designed for closed-book QA tasks, capable of answering questions without accessing external knowledge sources.

Model Features

Closed-book QA

The model can answer questions without accessing external knowledge sources.

Multi-stage training

The model is first pretrained on the C4 dataset, then undergoes additional pretraining on Wikipedia, and is finally fine-tuned on the Natural Questions dataset.

High performance

Demonstrates excellent performance on the Natural Questions test set with an exact match rate of 35.6.

Model Capabilities

Closed-book QA

Text generation

Use Cases

Question answering systems

Historical fact QA

Answer questions about historical figures or events.

Exact match rate 35.6

General knowledge QA

Answer various general knowledge questions.

🚀 Google's T5 for Closed Book Question Answering

Google's T5 model designed for Closed Book Question Answering, leveraging multiple datasets for training to effectively answer questions without external context.

🚀 Quick Start

This model, Google's T5, is tailored for Closed Book Question Answering. It undergoes a multi - stage training process: first, pre - trained using T5's denoising objective on C4, then additionally pre - trained using REALM's salient span masking objective on Wikipedia, and finally fine - tuned on Natural Questions (NQ).

⚠️ Important Note

The model was fine - tuned on 100% of the train splits of Natural Questions (NQ) for 10k steps.

Other community Checkpoints can be found here.

The related paper is How Much Knowledge Can You Pack Into the Parameters of a Language Model?, authored by Adam Roberts, Colin Raffel, Noam Shazeer.

✨ Features

Datasets

Property	Details
Training Datasets	c4, wikipedia, natural_questions
Pipeline Tag	text2text - generation
License	apache - 2.0

Results on Natural Questions - Test Set

Id	Link	Exact Match
T5 - small	https://huggingface.co/google/t5 - small - ssm - nq	25.5
T5 - large	https://huggingface.co/google/t5 - large - ssm - nq	30.4
T5 - xl	https://huggingface.co/google/t5 - xl - ssm - nq	35.6
T5 - xxl	https://huggingface.co/google/t5 - xxl - ssm - nq	37.9
T5 - 3b	https://huggingface.co/google/t5 - 3b - ssm - nq	33.2
T5 - 11b	https://huggingface.co/google/t5 - 11b - ssm - nq	36.6

💻 Usage Examples

Basic Usage

The model can be used as follows for closed book question answering:

from transformers import AutoModelForSeq2SeqLM, AutoTokenizer

t5_qa_model = AutoModelForSeq2SeqLM.from_pretrained("google/t5-xl-ssm-nq")
t5_tok = AutoTokenizer.from_pretrained("google/t5-xl-ssm-nq")

input_ids = t5_tok("When was Franklin D. Roosevelt born?", return_tensors="pt").input_ids
gen_output = t5_qa_model.generate(input_ids)[0]

print(t5_tok.decode(gen_output, skip_special_tokens=True))

📚 Documentation

Abstract

It has recently been observed that neural language models trained on unstructured text can implicitly store and retrieve knowledge using natural language queries. In this short paper, we measure the practical utility of this approach by fine - tuning pre - trained models to answer questions without access to any external context or knowledge. We show that this approach scales with model size and performs competitively with open - domain systems that explicitly retrieve answers from an external knowledge source when answering questions. To facilitate reproducibility and future work, we release our code and trained models at https://goo.gle/t5 - cbqa.

model image

📄 License

This project is licensed under the apache - 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご