R

Russian Inappropriate Messages

Developed by apanc
Designed to detect inappropriate content in Russian that lacks profanity but may harm the speaker's reputation
Downloads 4,039
Release Time : 3/2/2022

Model Overview

This model serves as an additional layer after toxicity filtering, specifically detecting subtly inappropriate messages in Russian. Based on sensitive topic classification, it can identify potentially harmful expressions such as justifying violence or offending religious sentiments.

Model Features

Fine-grained inappropriateness detection
Focuses on non-toxic but reputation-damaging expressions, such as justifying criminal behavior or offending religious sentiments
Sensitive topic correlation
Detection of inappropriate content strongly associated with specific sensitive topics (e.g., religion, crime)
Multi-stage filtering
Designed as a complementary filtering layer after toxicity detection, forming a multi-stage content moderation process

Model Capabilities

Russian text classification
Inappropriate content identification
Sensitive topic correlation analysis

Use Cases

Content moderation
Social media filtering
Additional inappropriate content detection after basic toxicity filtering
Can reduce missed inappropriate content by 89% (test set accuracy)
Corporate reputation protection
Detecting potentially damaging employee/user statements
Identifies non-explicit but potentially risky expressions
Academic research
Linguistic behavior analysis
Studying linguistic features of inappropriate expressions in Russian
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase