L

Lsg Bart Base 16384 Arxiv

Developed by ccdv
A long-sequence processing model based on the BART architecture, optimized for scientific paper summarization tasks, supporting long-text input up to 16,384 tokens
Downloads 29
Release Time : 5/9/2022

Model Overview

This model employs a local-sparse-global attention mechanism to handle long sequences, fine-tuned on the scientific_papers arxiv dataset, suitable for scientific paper summarization tasks

Model Features

Long Sequence Processing Capability
Supports long-text input up to 16,384 tokens, suitable for processing complete scientific papers
Local-Sparse-Global Attention Mechanism
An innovative combination of attention mechanisms that effectively balances computational efficiency and model performance for long-sequence processing
Scientific Paper Optimization
Specifically fine-tuned on the arxiv scientific papers dataset, optimized for academic text summarization tasks

Model Capabilities

Long Text Summarization
Scientific Paper Comprehension
Academic Text Processing

Use Cases

Academic Research
Automatic Scientific Paper Summarization
Generates concise and accurate summaries for lengthy scientific papers
ROUGE-1: 48.74, ROUGE-2: 20.88, ROUGE-L: 28.50
Academic Literature Processing
Processing and analyzing long-form academic literature content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase