P

Pegasus Reddit Tifu

Developed by google
PEGASUS is a pretrained model based on extracting gap sentences, specifically designed for abstractive summarization tasks.
Downloads 17
Release Time : 3/2/2022

Model Overview

PEGASUS is a pretrained model that trains by extracting key sentences from documents, particularly suitable for generating high-quality text summaries.

Model Features

Mixed and Random Training
Trained simultaneously on C4 and HugeNews datasets using mixed ratio weighting and increased training steps to enhance performance.
Dynamic Sentence Sampling
Uniformly samples 15% to 45% gap sentence ratios and adds 20% uniform noise during important sentence sampling.
Improved Tokenizer
Updated SentencePiece tokenizer to encode line breaks, preserving paragraph segmentation information.

Model Capabilities

Text Summarization Generation
Multi-Document Summarization
Abstractive Summarization Generation

Use Cases

News Summarization
News Article Summarization
Generates concise summaries from long news articles.
Achieves 44.16 ROUGE-1 score on the CNN/DailyMail dataset.
Academic Paper Summarization
arXiv Paper Summarization
Generates summaries for academic papers.
Achieves 44.21 ROUGE-1 score on the arXiv dataset.
Technical Document Summarization
Patent Document Summarization
Generates summaries for patent documents.
Achieves 52.29 ROUGE-1 score on the BigPatent dataset.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase