TriviaQA-T5-Base Open-Source Trivia Question Answering Model - Achieving Closed-Book QA without Context Training

Home

Triviaqa T5 Base

Developed by deep-learning-analytics

A trivia QA model based on T5-base architecture, achieving closed-book QA capability through context-free training

Question Answering System

Transformers

English#Closed-book QA #T5 architecture #Trivia knowledge base

Downloads 79

Release Time : 3/2/2022

Model Overview

Designed for trivia questions, this model can retrieve and return answers from its memory bank, suitable for scenarios requiring quick factual responses

Model Features

Closed-book QA capability

Directly answers trivia questions without requiring context

Efficient training

Uses 135 training epochs with streamlined input/output length settings

Lightweight deployment

Based on T5-base architecture, suitable for resource-constrained environments

Model Capabilities

Factual question answering

Short text generation

Knowledge retrieval

Use Cases

Entertainment applications

Trivia quiz games

Backend engine for building Q&A games

Can handle approximately 70% of simple trivia questions

Basic module for smart customer service

Handles common factual queries

🚀 Closed Book Trivia-QA T5 base

A T5-base model trained on No Context Trivia QA dataset, designed to answer trivia questions from its memory.

🚀 Quick Start

You can test the model on Trivia Questions from the following websites:

✨ Features

Trained on Specific Dataset: This is a T5-base model trained on the No Context Trivia QA dataset.
Closed-Book Answering: The model searches for answers in its memory to respond to trivia - type questions.
Pretrained on C4: The pretrained model was trained on the Common Crawl (C4) dataset.
Defined Training Parameters: Trained for 135 epochs with a batch size of 32 and a learning rate of 1e - 3.
Set Input and Output Lengths: max_input_length is set as 25 and max_output_length is 10.
Performance Metrics: Attained an EM score of 17 and a Subset Match score of 24.5.

📦 Installation

No specific installation steps are provided in the original README.

💻 Usage Examples

Basic Usage

from transformers import AutoTokenizer, AutoModelWithLMHead

tokenizer = AutoTokenizer.from_pretrained("deep-learning-analytics/triviaqa-t5-base")
model = AutoModelWithLMHead.from_pretrained("deep-learning-analytics/triviaqa-t5-base")

device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
model = model.to(device)

text = "Who directed the movie Jaws?"

preprocess_text = text.strip().replace("\n","")
tokenized_text = tokenizer.encode(preprocess_text, return_tensors="pt").to(device)

outs = model.model.generate(
            tokenized_text,
            max_length=10,
            num_beams=2,
            early_stopping=True
           )

dec = [tokenizer.decode(ids) for ids in outs]
print("Predicted Answer: ", dec)

📚 Documentation

We have written a blog post that covers the training procedure. Please find it here.

🔧 Technical Details

This is a T5 - base model trained on the No Context Trivia QA data set. The input to the model is a Trivia type question. The model is tuned to search for the answer in its memory to return it. The pretrained model used here was trained on the Common Crawl (C4) data set. The model was trained for 135 epochs using a batch size of 32 and a learning rate of 1e - 3. max_input_length is set as 25 and max_output_length is 10. The model attained an EM score of 17 and a Subset Match score of 24.5.

📄 License

No license information is provided in the original README.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご