T5-small-ssm-nq Open-source Q&A Model - Supports Closed-book Q&A, Pre-tuned on Multiple Datasets and Super Practical

T5 Small Ssm Nq

Developed by google

T5-small-ssm-nq is a small closed-book QA model based on the T5 architecture, pre-trained and fine-tuned on C4, Wikipedia, and Natural Questions (NQ) datasets.

Question Answering System EnglishOpen Source License:Apache-2.0 #Closed-book QA #Knowledge internalization #Retrieval-free generation

Downloads 300

Release Time : 3/2/2022

Model Overview

This model is designed for closed-book QA tasks, capable of answering questions without external knowledge sources.

Model Features

Closed-book QA

The model can answer questions without accessing external knowledge sources.

Multi-stage training

Pre-trained on the C4 dataset, then on Wikipedia with salient span masking, and finally fine-tuned on the Natural Questions dataset.

Scalability

Model performance improves with scale, gradually increasing from small to xxl versions.

Model Capabilities

Closed-book QA

Knowledge retrieval

Text generation

Use Cases

QA systems

Factual question answering

Answering factual questions about historical figures, events, etc.

Achieved an exact match score of 25.5 on the Natural Questions test set

🚀 Google's T5 for Closed Book Question Answering

Google's T5 model is designed for Closed Book Question Answering, leveraging pre - training and fine - tuning on multiple datasets to provide effective question - answering capabilities.

📦 Installation

Since the provided README doesn't have specific installation steps, this section is skipped.

✨ Features

Multi - stage Training: The model was pre - trained using T5's denoising objective on C4, then additionally pre - trained using REALM's salient span masking objective on Wikipedia, and finally fine - tuned on Natural Questions (NQ).
Fine - tuning Details: The model was fine - tuned on 100% of the train splits of Natural Questions (NQ) for 10k steps.

📚 Documentation

Datasets Used | Property | Details | |----------|---------| | Model Type | Google's T5 for Closed Book Question Answering | | Training Data | C4, Wikipedia, Natural Questions (NQ) |
Related Links
- Google's T5 Blog Post
- Other community Checkpoints: here
- Paper: How Much Knowledge Can You Pack Into the Parameters of a Language Model?
- Authors: Adam Roberts, Colin Raffel, Noam Shazeer

💻 Usage Examples

Basic Usage

The model can be used as follows for closed book question answering:

from transformers import AutoModelForSeq2SeqLM, AutoTokenizer

t5_qa_model = AutoModelForSeq2SeqLM.from_pretrained("google/t5-small-ssm-nq")
t5_tok = AutoTokenizer.from_pretrained("google/t5-small-ssm-nq")

input_ids = t5_tok("When was Franklin D. Roosevelt born?", return_tensors="pt").input_ids
gen_output = t5_qa_model.generate(input_ids)[0]

print(t5_tok.decode(gen_output, skip_special_tokens=True))

📊 Results on Natural Questions - Test Set

Id	link	Exact Match
T5 - small	https://huggingface.co/google/t5-small-ssm-nq	25.5
T5 - large	https://huggingface.co/google/t5-large-ssm-nq	30.4
T5 - xl	https://huggingface.co/google/t5-xl-ssm-nq	35.6
T5 - xxl	https://huggingface.co/google/t5-xxl-ssm-nq	37.9
T5 - 3b	https://huggingface.co/google/t5-3b-ssm-nq	33.2
T5 - 11b	https://huggingface.co/google/t5-11b-ssm-nq	36.6

🔧 Technical Details

It has recently been observed that neural language models trained on unstructured text can implicitly store and retrieve knowledge using natural language queries. In this short paper, we measure the practical utility of this approach by fine - tuning pre - trained models to answer questions without access to any external context or knowledge. We show that this approach scales with model size and performs competitively with open - domain systems that explicitly retrieve answers from an external knowledge source when answering questions. To facilitate reproducibility and future work, we release our code and trained models at https://goo.gle/t5 - cbqa.

model image

📄 License

The model is licensed under the apache - 2.0 license.

⚠️ Important Note

The model was fine - tuned on 100% of the train splits of Natural Questions (NQ) for 10k steps.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご