### bart_summarizer_model Open-source Text Summarization Model – Convert Long Texts into Concise and Coherent Summaries for Free

Bart Summarizer Model

Developed by KipperDev

A text summarization model fine-tuned based on facebook/bart-base, excelling in generating concise and coherent summaries from lengthy texts.

Text Generation

Transformers

EnglishOpen Source License:MIT #Patent Abstract Generation #BART Fine-tuning #Long Text Compression

Downloads 30

Release Time : 1/25/2024

Model Overview

This model leverages BART's bidirectional encoder and auto-regressive decoder architecture, specifically optimized for text summarization tasks, suitable for generating summaries from long texts such as research reports, papers, or articles.

Model Features

Patent Data Fine-tuning

Trained using the Big Patent Dataset, capable of handling technical patent document summaries.

Prefix Prompt Optimization

Adding the 'summarize:' prefix prompt significantly improves the quality of summary generation.

Multi-round Training

Adopts a three-round training strategy with different parameters to progressively optimize model performance.

Model Capabilities

Long Text Compression

Technical Document Summarization

Core Content Extraction

Use Cases

Professional Document Processing

Patent Document Summarization

Quickly generate summaries of technical points from patent documents

ROUGE-1 score reaches 0.5007

Research Report Summarization

Extract core findings and conclusions from research reports

Content Creation Assistance

Article Summarization

Generate concise summaries for lengthy news or blog articles

🚀 KipperDev/bart_summarizer_model

A fine - tuned BART - base model for text summarization, capable of generating concise and informative summaries from long - form texts.

🚀 Quick Start

This model is a fine - tuned variant of [facebook/bart - base](https://huggingface.co/facebook/bart - base) designed for text summarization. It uses the BART bidirectional (BERT - like) encoder and an autoregressive (GPT - like) decoder to generate concise, coherent, and informative summaries from extensive text documents.

✨ Features

Specialized for text summarization, helping users quickly understand the essence of long - form texts.
Trained on a large - scale patent dataset, ensuring broad coverage and robust performance.
Evaluated using the ROUGE metric, showing good alignment with human - written abstracts.

📦 Installation

Install the transformers library with pip:

pip install transformers

💻 Usage Examples

Basic Usage

from transformers import pipeline
from transformers import AutoTokenizer
from transformers import AutoModelForSeq2SeqLM

model_name = "KipperDev/bart_summarizer_model"

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSeq2SeqLM.from_pretrained(model_name)
summarizer = pipeline("summarization", model=model, tokenizer=tokenizer)

# Example usage
prefix = "summarize: "
input_text = "Your input text here."
input_ids = tokenizer.encode(prefix + input_text, return_tensors="pt")
summary_ids = model.generate(input_ids)
summary = tokenizer.decode(summary_ids[0], skip_special_tokens=True)

print(summary)

⚠️ Important Note

FOR THE MODEL TO WORK AS INTENDED, YOU NEED TO APPEND THE 'summarize:' PREFIX BEFORE THE INPUT DATA

📚 Documentation

Training Details

Training Data

The model was trained using the Big Patent Dataset, which consists of 1.3 million US patent documents and their corresponding human - written summaries. This dataset was selected due to its rich language and complex structure, which is representative of the challenging nature of document summarization tasks. Multiple subsets of the dataset were used during training to ensure broad coverage and robust model performance across varied document types.

Training Procedure

Training was carried out over three rounds. The initial settings included a learning rate of 0.00002, a batch size of 8, and 4 epochs. In subsequent rounds, these parameters were adjusted to 0.0003, 8, and 12 respectively to further refine model performance. A linear decay learning rate schedule was also applied to enhance model learning efficiency over time.

Training Results

Model performance was evaluated using the ROUGE metric, demonstrating its ability to generate summaries that closely match human - written abstracts.

Property	Details
Evaluation Loss (Eval Loss)	1.9244
Rouge - 1	0.5007
Rouge - 2	0.2704
Rouge - L	0.3627
Rouge - Lsum	0.3636
Average Generation Length (Gen Len)	122.1489
Runtime (seconds)	1459.3826
Samples per Second	1.312
Steps per Second	0.164

📄 License

This project is licensed under the MIT license.

📖 Citation

BibTeX:

@article{kipper_t5_summarizer,
 // SOON
}

👥 Authors

This model card was written by Fernanda Kipper

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご