L

Longformer Base Plagiarism Detection

Developed by jpwahle
This model is trained using the Longformer architecture, specifically designed to detect machine-paraphrased plagiarized text, with significant application value in maintaining academic integrity.
Downloads 59.47k
Release Time : 3/2/2022

Model Overview

A plagiarism detection system fine-tuned on the Longformer-base-4096 pre-trained model, capable of identifying academically rewritten text using tools like SpinBot, achieving an average F1 score of 80.99%.

Model Features

Long Document Processing Capability
Utilizes sliding window attention mechanism to effectively process academic documents up to 4096 tokens in length.
Multi-Paraphrasing Tool Recognition
Optimized detection effectiveness for mainstream paraphrasing tools like SpinBot and SpinnerChief.
Academic Scenario Optimization
Excellent performance on academic texts such as preprints and dissertations (F1 score up to 99.68%).

Model Capabilities

Machine Paraphrased Text Recognition
Academic Plagiarism Detection
Long Text Semantic Analysis

Use Cases

Academic Integrity Maintenance
Thesis Plagiarism Detection
Identifies plagiarized content disguised using paraphrasing tools in student papers.
Achieves an F1 score of 99.68% for SpinBot-paraphrased text detection.
Publication Review Assistance
Assists journal editors in detecting potential plagiarism in submitted papers.
Outperforms traditional text-matching systems like Turnitin.
Education Quality Assurance
Homework Originality Check
Automatically screens student assignments for machine-paraphrased content.
Human evaluation consistency reaches 78.4%.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase