Longformer Base Plagiarism Detection
This model is trained using the Longformer architecture, specifically designed to detect machine-paraphrased plagiarized text, with significant application value in maintaining academic integrity.
Downloads 59.47k
Release Time : 3/2/2022
Model Overview
A plagiarism detection system fine-tuned on the Longformer-base-4096 pre-trained model, capable of identifying academically rewritten text using tools like SpinBot, achieving an average F1 score of 80.99%.
Model Features
Long Document Processing Capability
Utilizes sliding window attention mechanism to effectively process academic documents up to 4096 tokens in length.
Multi-Paraphrasing Tool Recognition
Optimized detection effectiveness for mainstream paraphrasing tools like SpinBot and SpinnerChief.
Academic Scenario Optimization
Excellent performance on academic texts such as preprints and dissertations (F1 score up to 99.68%).
Model Capabilities
Machine Paraphrased Text Recognition
Academic Plagiarism Detection
Long Text Semantic Analysis
Use Cases
Academic Integrity Maintenance
Thesis Plagiarism Detection
Identifies plagiarized content disguised using paraphrasing tools in student papers.
Achieves an F1 score of 99.68% for SpinBot-paraphrased text detection.
Publication Review Assistance
Assists journal editors in detecting potential plagiarism in submitted papers.
Outperforms traditional text-matching systems like Turnitin.
Education Quality Assurance
Homework Originality Check
Automatically screens student assignments for machine-paraphrased content.
Human evaluation consistency reaches 78.4%.
Featured Recommended AI Models