T

T5 V1 1 Base

Developed by google
T5 1.1 is Google's improved text-to-text transfer model, utilizing the GEGLU activation function and optimized architecture, focused on unsupervised pretraining
Downloads 150.73k
Release Time : 3/2/2022

Model Overview

Enhanced T5 model with improved transfer learning performance through architectural optimizations, requires fine-tuning for downstream NLP tasks

Model Features

GEGLU Activation Function
Uses GEGLU instead of ReLU in feed-forward hidden layers to enhance model expressiveness
Pure Unsupervised Pretraining
Pretrained exclusively on C4 dataset without mixing downstream task data
Parameter Sharing Optimization
Removes parameter sharing between embedding and classifier layers for improved model flexibility
Architectural Optimization
Adjusted dimension configurations for xl/xxl variants, increasing d_model while reducing attention heads

Model Capabilities

Text Generation
Text Classification
Question Answering
Summarization
Machine Translation

Use Cases

Text Generation
Content Summarization
Generates concise summaries of long documents
Achieves SOTA on CNN/Daily Mail dataset
Question Answering
Open-domain QA
Answers natural language questions based on textual knowledge
Excellent performance on Natural Questions benchmark
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase