Russian Toxicity Classifier Open-Source Model - Accurately Identify Toxic Comments in Russian Texts

Russian Toxicity Classifier

Developed by s-nlp

A Russian toxicity comment classification model fine-tuned on conversational RuBERT, capable of accurately identifying toxic content in Russian text.

Text Classification

Transformers

Other#Russian Toxicity Detection #High-precision Classification #Social Media Moderation

Downloads 17.93k

Release Time : 3/2/2022

Model Overview

This model is a classifier based on the BERT architecture, specifically designed to identify toxic comments in Russian text. It was trained by merging two Russian toxicity comment datasets, achieving high classification accuracy.

Model Features

High Accuracy

Achieved an accuracy of 0.97 on the test set, with an F1 score of 0.93 for toxic comments.

Multi-source Data Training

Combined two Russian toxicity comment datasets from 2ch.hk and ok.ru, enhancing the model's generalization capability.

Based on Conversational RuBERT

Fine-tuned on DeepPavlov/rubert-base-cased-conversational, making it particularly suitable for processing conversational text.

Model Capabilities

Russian Text Classification

Toxic Content Detection

Comment Content Analysis

Use Cases

Content Moderation

Social Media Comment Filtering

Automatically identify and filter toxic comments on social media platforms

Accuracy as high as 97%, effectively reducing inappropriate content on platforms

Forum Content Management

Assist forum administrators in identifying and handling toxic remarks

F1 score of 0.93, accurately marking comments requiring manual review

	precision	recall	f1-score	support
0	0.98	0.99	0.98	21384
1	0.94	0.92	0.93	4886
accuracy			0.97	26270
macro avg	0.96	0.96	0.96	26270
weighted avg	0.97	0.97	0.97	26270

Property	Details
Model Type	BERT-based classifier
Training Data	Merge of Russian Language Toxic Comments dataset from 2ch.hk and Toxic Russian Comments dataset from ok.ru
License	OpenRAIL++
Base Model	DeepPavlov/rubert-base-cased-conversational
Tags	toxic comments classification

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Russian Toxicity Classifier

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Russian Toxic Comments Classifier

🚀 Quick Start

💻 Usage Examples

Basic Usage

📚 Documentation

Citation

📄 License