V

Vit Base Nsfw Detector

Developed by AdamCodd
An image classification model based on Vision Transformer (ViT) architecture, specifically designed to detect whether images contain NSFW (Not Safe For Work) content.
Downloads 1.2M
Release Time : 1/3/2024

Model Overview

This model is a fine-tuned version of google/vit-base-patch16-384, used to classify images as SFW (Safe For Work) or NSFW (Not Safe For Work). The training data includes approximately 25,000 images (paintings, photos, etc.).

Model Features

High accuracy
Achieves an accuracy of 96.54% and an AUC of 0.9948 on the evaluation dataset.
Conservative classification strategy
The model adopts a conservative approach during training, classifying 'sexy' images as NSFW to ensure safe content is not misclassified.
Support for multiple image types
Trained on various image types (realistic, 3D, paintings), demonstrating strong generalization capabilities.

Model Capabilities

Image classification
NSFW content detection
SFW content detection

Use Cases

Content moderation
Social media content filtering
Automatically detects whether user-uploaded images contain NSFW content, assisting platforms in content moderation.
96.54% accuracy, effectively reducing manual review workload.
Safe search filtering
Filters out unsafe image content in search engines or image libraries.
High accuracy and AUC values ensure effective filtering.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase