P

Pegasus Large

Developed by google
PEGASUS is an abstractive summarization model based on pre-training with gap sentences, developed by Google Research.
Downloads 43.35k
Release Time : 3/2/2022

Model Overview

PEGASUS is a pre-trained model specifically designed for abstractive summarization, utilizing gap sentences for pre-training and suitable for various summarization tasks.

Model Features

Mixed and Random Training
Trained on both C4 and HugeNews datasets with sample-size-weighted mixing ratio for 1.5 million steps.
Dynamic Sentence Sampling
Uniformly samples 15% to 45% gap sentence ratio with 20% uniform noise added to importance scores.
Improved Tokenizer
Updated SentencePiece tokenizer to support encoding line breaks, enhancing paragraph segmentation.

Model Capabilities

Text Summarization Generation
Multi-dataset Adaptation
Abstractive Summarization

Use Cases

News Summarization
CNN/DailyMail Summarization
Generates concise summaries for CNN/DailyMail news articles.
ROUGE-1/2/L: 44.16/21.56/41.30
XSum Summarization
Produces results for extreme summarization (single-sentence summary) tasks.
ROUGE-1/2/L: 47.60/24.83/39.64
Academic Paper Summarization
arXiv Summarization
Generates summaries for arXiv academic papers.
ROUGE-1/2/L: 44.21/16.95/25.67
PubMed Summarization
Generates summaries for PubMed medical papers.
ROUGE-1/2/L: 45.97/20.15/28.25
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase