P

Pegasus Billsum

Developed by google
PEGASUS is an abstractive summarization pre-trained model based on gap sentence extraction, focused on generating high-quality text summaries.
Downloads 295
Release Time : 3/2/2022

Model Overview

PEGASUS is a pre-trained model specifically designed for text summarization. It excels in multiple summarization tasks by leveraging gap sentence extraction during pre-training.

Model Features

Mixed and Random Training
Trained on both C4 and HugeNews datasets with mixed ratio weighting, increasing training steps to 1.5 million to enhance model performance.
Dynamic Sentence Sampling
Uniformly samples gap sentence ratios between 15% to 45% and adds 20% uniform noise during important sentence sampling to improve model robustness.
Improved Tokenizer
Updated SentencePiece tokenizer to support encoding line breaks, preventing information loss.

Model Capabilities

Text Summarization
Multilingual Support
High-precision Summarization

Use Cases

News Summarization
News Article Summarization
Generates concise summaries of news articles while retaining key information.
Achieved a ROUGE-1 score of 44.16 on the CNN/DailyMail dataset.
Academic Paper Summarization
Academic Paper Summarization
Generates summaries of academic papers to aid quick comprehension.
Achieved a ROUGE-1 score of 44.21 on the arXiv dataset.
Technical Document Summarization
Technical Document Summarization
Generates summaries of technical documents for quick browsing.
Achieved a ROUGE-1 score of 52.29 on the BigPatent dataset.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase