L

Lsg Bart Base 4096 Mediasum

Developed by ccdv
BART-base model based on LSG technology, fine-tuned for long text summarization tasks on the MediaSum dataset, supporting sequence processing up to 4096 tokens
Downloads 44
Release Time : 5/29/2022

Model Overview

This model employs local-sparse-global attention mechanisms to handle long sequences, making it suitable for long text summarization generation tasks. It is modified based on the BART-base architecture and fine-tuned on the MediaSum dialogue summarization dataset.

Model Features

Long Sequence Processing Capability
Supports input sequences up to 4096 tokens, efficiently processing long texts through local-sparse-global attention mechanisms
Multi-mode Sparse Attention
Offers various sparse attention modes including local/pooling/strided/block-strided/normalized/LSH for flexible selection
Resource Efficiency Optimization
Allows adjustment of block size (32-256) to balance performance and resource consumption, adapting to different hardware conditions

Model Capabilities

Long Text Summarization Generation
Dialogue Content Summarization
Multi-turn Dialogue Understanding

Use Cases

Media Content Processing
Interview Summarization Generation
Condenses lengthy media interview content into concise summaries
Achieves R1=35.16/R2=18.13/RL=31.54 on the MediaSum test set
Meeting Minutes Processing
Automatic Meeting Minutes Generation
Extracts key points from transcribed texts of lengthy meeting recordings
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase