L

Led Base 16384 Billsum Summarization

Developed by AlgorithmicResearchGroup
This model is a fine-tuned version of led-base-16384 on the billsum dataset, specifically designed for long document summarization tasks.
Downloads 15
Release Time : 11/26/2022

Model Overview

A text summarization model based on the LED architecture, optimized for long documents such as legal texts, supporting input texts up to 16K in length.

Model Features

Long Text Processing
Supports processing long documents up to 16,384 tokens by replicating the position embedding matrix.
Legal Text Optimization
Fine-tuned on the billsum dataset, making it particularly suitable for summarizing formal documents like legal texts.
Efficient Encoding-Decoding
Utilizes the LED architecture, combining Longformer's encoding capabilities with BART's decoding capabilities.

Model Capabilities

Long Document Summarization
Legal Text Processing
Structured Information Extraction

Use Cases

Legal Document Processing
Legal Text Summarization
Automatically generates concise summaries of lengthy legal texts
ROUGE-1 score 47.672
Bill Content Extraction
Extracts key clauses and amendments from complex bills
ROUGE-L score 34.568
Government Document Processing
Policy Document Summarization
Generates executive summaries for government policy documents
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase