Summarization Open-Source Abstract Generation Model - Combining the Advantages of Extraction and Generation to Improve Information Readability

Summarization

Developed by LiqiangXiao

A hybrid summarization generation model based on hierarchical reinforcement learning, combining the advantages of extractive and abstractive summarization to enhance information richness and readability

Text Generation

Transformers

#Hybrid Summarization Generation #Hierarchical Reinforcement Learning #Information Retention Optimization

Downloads 18

Release Time : 3/2/2022

Model Overview

The model employs actor-critic reinforcement learning for training, utilizing hierarchical Transformer modules to represent articles at both word and sentence levels, achieving a summarization approach closer to human workflow

Model Features

Hybrid Summarization Framework

Automatically switches between 'copy sentences' and 'rewrite sentences' modes based on redundancy levels, combining the strengths of extractive and abstractive summarization

Hierarchical Reinforcement Learning

End-to-end reinforcement training method enhances collaboration between extraction and rewriting modules

Hierarchical Transformer

Represents articles at both word and sentence levels simultaneously, improving representation efficiency

Model Capabilities

Text Summarization Generation

Hierarchical Article Representation

Sentence-Level Content Selection

Use Cases

Content Summarization

News Summarization Generation

Generates informative and concise summaries for news articles

Achieved a 1.7-point improvement in ROUGE scores on the CNN/DailyMail dataset

Article Representation

Extracts article features using hierarchical representation modules

🚀 Copy-or-Rewrite

This repository contains a model for human-like summarization, trained with Actor-critic Reinforcement Learning. It significantly improves ROUGE scores and enhances the informativity and readability of summaries.

🚀 Quick Start

With this repository, you can generate informative and concise summaries for input articles. For other tasks, you may use the hierarchical representation module to effectively represent the article. The parameters of the model are pre-trained on the CNN/DM dataset. You may need to fine-tune it on your own dataset when needed.

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("LiqiangXiao/summarization")

model = AutoModelForSeq2SeqLM.from_pretrained("LiqiangXiao/summarization")

✨ Features

A model built for human-like summarization task and trained with Actor-critic Reinforcement Learning.
Significantly improved the ROUGE scores on CNN/DM dataset by 1.7 and augmented the informativity and readability of generated summaries.
Implemented a more human-like workflow for summarization task, solving the information loss problem.
Contains a novel hierarchical transformer module to represent articles at both word and sentence levels.
A new reinforcement learning method that can effectively train a two-step model.

📚 Documentation

Model description

Copy-or-Rewrite is a model to improve the workflow of summarization models. Existing methods that adopt an extract-then-abstract strategy have achieved impressive results, yet they suffer from the information loss in the abstraction step because they compress all the selected sentences without distinction. Especially when the whole sentence is summary-worthy, salient content would be lost by compression. To address this problem, we propose HYSUM, a hybrid framework for summarization that can flexibly switch between copying sentences and rewriting sentences according to the degree of redundancy. In this way, our approach can effectively combine the advantages of two branches of summarization, juggling informativity and conciseness. Moreover, we based on Hierarchical Reinforcement Learning, propose an end-to-end reinforcing method to bridge together the extraction module and rewriting module, which can enhance the cooperation between them. Automatic evaluation shows that our approach significantly outperforms the state-of-the-arts on the CNN/DailyMail corpus. Human evaluation also demonstrates that our generated summaries are more informative and concise than popular models.

📦 Installation

No specific installation steps are provided in the original README.

💻 Usage Examples

Basic Usage

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("LiqiangXiao/summarization")

model = AutoModelForSeq2SeqLM.from_pretrained("LiqiangXiao/summarization")

📄 License

No license information is provided in the original README.

📚 Training data

Property	Details
Training Data	This model used the non-anonymous version of CNN/Daily Mail dataset.

📚 BibTeX entry and citation info

@inproceedings{DBLP:conf/aaai/XiaoWHJ20,
  author    = {Liqiang Xiao and
               Lu Wang and
               Hao He and
               Yaohui Jin},
  title     = {Copy or Rewrite: Hybrid Summarization with Hierarchical Reinforcement
               Learning},
  booktitle = {The Thirty-Fourth {AAAI} Conference on Artificial Intelligence, {AAAI}
               2020, The Thirty-Second Innovative Applications of Artificial Intelligence
               Conference, {IAAI} 2020, The Tenth {AAAI} Symposium on Educational
               Advances in Artificial Intelligence, {EAAI} 2020, New York, NY, USA,
               February 7-12, 2020},
  pages     = {9306--9313},
  publisher = {{AAAI} Press},
  year      = {2020},
  url       = {https://aaai.org/ojs/index.php/AAAI/article/view/6470},
  timestamp = {Tue, 02 Feb 2021 08:00:14 +0100},
  biburl    = {https://dblp.org/rec/conf/aaai/XiaoWHJ20.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご