R

Roberta Toxicity Classifier

Developed by s-nlp
A toxicity classification model fine-tuned based on RoBERTa-large, trained on the Jigsaw competition dataset for identifying toxic content in English text.
Downloads 80.61k
Release Time : 3/2/2022

Model Overview

This model is specifically designed for toxicity classification of English comments, effectively identifying harmful content in text. Trained on 2 million samples, it demonstrates excellent performance on test sets.

Model Features

High-performance classification
Achieves outstanding performance on the Jigsaw competition test set with AUC-ROC 0.98 and F1 score 0.76
Large-scale training data
Incorporates approximately 2 million English samples from three Jigsaw competitions for training
Optimized based on RoBERTa
Utilizes the robustly optimized RoBERTa-large pre-trained model for fine-tuning

Model Capabilities

Text toxicity classification
Harmful content detection
Natural language processing

Use Cases

Content moderation
Social media comment filtering
Automatically identifies and filters harmful comments on social media platforms
Effectively reduces toxic content on platforms
Online community management
Helps forum and community administrators quickly identify inappropriate remarks
Improves community content quality
Academic research
Language toxicity research
Used to study toxic characteristics and patterns in online language
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase