P

Pegasus Pubmed

Developed by google
PEGASUS is an abstract summarization pre-trained model based on gap sentence extraction, focusing on text summarization tasks.
Downloads 807
Release Time : 3/2/2022

Model Overview

PEGASUS is a pre-trained model specifically designed for generating high-quality text summaries. It employs gap sentence extraction for pre-training, enabling effective understanding and compression of textual content.

Model Features

Mixed and Random Training
Trained simultaneously on C4 and HugeNews datasets, using random sampling of gap sentence ratios and importance scores with added noise to enhance model performance.
Upgraded Tokenizer
Updated SentencePiece tokenizer to support newline encoding, improving paragraph segmentation handling.
Multi-Dataset Support
Excels on multiple text summarization datasets, including xsum, cnn_dailymail, newsroom, etc.

Model Capabilities

Text Summary Generation
Multi-Dataset Adaptation
Abstract Summarization

Use Cases

News Summarization
News Article Summarization
Generate concise summaries of news articles while retaining key information.
Achieved ROUGE scores of 44.16/21.56/41.30 on the cnn_dailymail dataset.
Academic Paper Summarization
Academic Paper Summarization
Generate succinct summaries of academic papers to aid quick comprehension.
Achieved ROUGE scores of 44.21/16.95/25.67 on the arxiv dataset.
Technical Document Summarization
Patent Document Summarization
Generate summaries of technical patent documents, extracting core innovation points.
Achieved ROUGE scores of 52.29/33.08/41.66 on the big_patent dataset.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase