L

Longformer Base 4096 Finetuned Squadv2

Developed by mrm8488
This model is a QA system based on the Longformer architecture, fine-tuned on the SQuAD v2 dataset, supporting long text sequences (up to 4096 tokens).
Downloads 190
Release Time : 3/2/2022

Model Overview

Longformer-base-4096 is a Transformer model designed for long documents, initialized from RoBERTa and fine-tuned on the SQuAD v2 dataset for QA tasks. It combines sliding window local attention with global attention mechanisms, making it suitable for long-document QA tasks.

Model Features

Long-Text Processing Capability
Supports sequences up to 4096 tokens, ideal for long-document QA tasks.
Hybrid Attention Mechanism
Combines sliding window local attention with global attention to capture long-range dependencies while maintaining efficiency.
High-Precision QA
Achieves 79.92% exact match and 83.35% F1 score on the SQuAD v2 validation set.

Model Capabilities

Long-form QA
Open-domain QA
No-Answer Detection

Use Cases

Document QA Systems
Legal Document Analysis
Extract answers to specific questions from lengthy legal documents.
Research Paper QA
Answer questions about academic papers or technical reports.
Customer Support
FAQ Automation
Answer customer questions from lengthy product documentation.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase