Led Base 16384 Billsum Summarization
L
Led Base 16384 Billsum Summarization
Developed by AlgorithmicResearchGroup
This model is a fine-tuned version of led-base-16384 on the billsum dataset, specifically designed for long document summarization tasks.
Text Generation
Transformers Supports Multiple Languages#Long Text Summarization#Legal Document Processing#16K Context

Downloads 15
Release Time : 11/26/2022
Model Overview
A text summarization model based on the LED architecture, optimized for long documents such as legal texts, supporting input texts up to 16K in length.
Model Features
Long Text Processing
Supports processing long documents up to 16,384 tokens by replicating the position embedding matrix.
Legal Text Optimization
Fine-tuned on the billsum dataset, making it particularly suitable for summarizing formal documents like legal texts.
Efficient Encoding-Decoding
Utilizes the LED architecture, combining Longformer's encoding capabilities with BART's decoding capabilities.
Model Capabilities
Long Document Summarization
Legal Text Processing
Structured Information Extraction
Use Cases
Legal Document Processing
Legal Text Summarization
Automatically generates concise summaries of lengthy legal texts
ROUGE-1 score 47.672
Bill Content Extraction
Extracts key clauses and amendments from complex bills
ROUGE-L score 34.568
Government Document Processing
Policy Document Summarization
Generates executive summaries for government policy documents
Featured Recommended AI Models
Š 2025AIbase