A

Ai Detector

Developed by SuperAnnotate
A RoBERTa Large fine-tuned model for detecting AI-generated content
Downloads 2,160
Release Time : 9/25/2024

Model Overview

This model is specifically designed to detect generated/synthetic text, playing a crucial role in training data filtering and identifying fraudulent activities in scientific and educational fields.

Model Features

Balanced Training Data
Trained with 44,000 balanced samples including human text and content generated by 14 different LLMs
Multi-Domain Coverage
Training data covers three major domains: Wikipedia, Reddit Q&A, and scientific research papers
Anti-Overfitting Design
Key n-grams removed through chi-square tests to ensure the model learns genuine features rather than superficial patterns
Good Calibration
Optimized loss function and label smoothing ensure predicted confidence matches actual accuracy

Model Capabilities

Detect AI-generated text
Identify content from large language models
Distinguish between human-written and machine-generated content

Use Cases

Education Sector
Academic Integrity Detection
Identify AI-generated content in student assignments
Achieves 98.5% accuracy in detecting GPT-4 generated text
Data Filtering
Training Data Purification
Filter synthetic text from datasets
98% accuracy in detecting LLaMA-Chat generated content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase