D

Distill Pegasus Cnn 16 4

Developed by sshleifer
PEGASUS is an abstractive summarization model pre-trained with gap sentences, developed by Google Research.
Downloads 286
Release Time : 3/2/2022

Model Overview

PEGASUS is a pre-trained model specifically designed for text summarization, trained by extracting important sentences, suitable for various summarization tasks.

Model Features

Mixed and Random Training
Trained on both C4 and HugeNews datasets with mixed ratio weighting and random sampling strategies.
Enhanced Tokenizer
Updated SentencePiece tokenizer supporting newline encoding to preserve paragraph segmentation.
Flexible Sentence Sampling
Uniformly samples 15%-45% sentence gaps and adds 20% uniform noise to importance scores.

Model Capabilities

Text summarization
Multi-document summarization
Abstractive summarization

Use Cases

News Summarization
CNN/DailyMail Summarization
Generates concise summaries of news articles
ROUGE-1/2/L: 44.16/21.56/41.30
XSum Summarization
Generates extreme summarization (single-sentence summaries)
ROUGE-1/2/L: 47.60/24.83/39.64
Academic Literature Summarization
arXiv Paper Summarization
Generates summaries of academic papers
ROUGE-1/2/L: 44.21/16.95/25.67
PubMed Summarization
Generates summaries of medical literature
ROUGE-1/2/L: 45.97/20.15/28.25
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase