Turkish Toxic Language Detection Open-Source Model - Free Implementation of Toxic Classification for Turkish Texts

Turkish Toxic Language Detection

Developed by fc63

This model is fine-tuned based on `dbmdz/bert-base-turkish-cased` and is used for binary toxicity classification of Turkish texts.

Text Classification

Transformers

OtherOpen Source License:MIT #Turkish text detection #High-precision toxicity classification #Social media content moderation

Downloads 107

Release Time : 12/30/2024

Model Overview

This model is used to detect toxic content in Turkish texts and supports binary classification (toxic/non-toxic).

Model Features

Multi-label support

Supports multiple evaluation metrics, such as accuracy, F1 score, precision, and recall.

Data processing

The training data has been cleaned and preprocessed, including removing URLs, mentions, special characters, slang, etc.

Training strategy

The `Trainer` API of Hugging Face is used for training, and undersampling is applied to balance the class distribution.

Model Capabilities

Turkish text classification

Toxic content detection

Use Cases

Content moderation

Social media content filtering

Automatically detect toxic comments or posts on social media.

The accuracy is as high as 96%

Online community management

Help administrators identify and remove toxic content to maintain a healthy community.

🚀 Turkish Toxic Language Detection Model

This model is a fine - tuned version of dbmdz/bert-base-turkish-cased for binary toxicity classification in Turkish text. It was trained using a cleaned and preprocessed version of the Overfit-GM/turkish-toxic-language dataset.

✨ Features

Accurate Classification: Achieves high accuracy, F1 - score, precision, and recall in classifying toxic and non - toxic Turkish text.
Cleaned Data Training: Trained on preprocessed text with basic cleaning and slang filtering.

📦 Installation

No specific installation steps are provided in the original README. So, this section is skipped.

💻 Usage Examples

Basic Usage

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

tokenizer = AutoTokenizer.from_pretrained("fc63/turkish_toxic_language_detection_model")
model = AutoModelForSequenceClassification.from_pretrained("fc63/turkish_toxic_language_detection_model")

def predict_toxicity(text):
    inputs = tokenizer(text, return_tensors="pt", truncation=True, padding="max_length", max_length=128)
    outputs = model(**inputs)
    predicted = torch.argmax(outputs.logits, dim=1).item()
    return "Toxic" if predicted == 1 else "Non-Toxic"

📚 Documentation

Performance

Metric	Non-Toxic	Toxic	Macro Avg
Precision	0.96	0.95	0.96
Recall	0.95	0.96	0.96
F1-score	0.96	0.96	0.96
Accuracy			0.96
Test Samples	5400	5414	10814

Confusion Matrix

	Pred: Non-Toxic	Pred: Toxic
True: Non-Toxic	5154	246
True: Toxic	200	5214

Preprocessing Details (cleaned_corrected_text)

The model is trained on the cleaned_corrected_text column, which is derived from corrected_text using basic regex - based cleaning steps and manual slang filtering.

Cleaning Function

def clean_corrected_text(text):
    text = text.lower()
    text = re.sub(r"http\S+|www\S+|https\S+", '', text, flags=re.MULTILINE)  # URL removal
    text = re.sub(r"@\w+", '', text)  # remove @mentions
    text = re.sub(r"[^\w\s.,!?-]", '', text)  # remove special characters (e.g., emojis)
    text = re.sub(r"\s+", ' ', text).strip()  # normalize whitespaces
    return text

Manual Slang Filtering

slang_words = ["kanka", "lan", "knk", "bro", "la", "birader", "kanki"]

def remove_slang(text):
    for word in slang_words:
        text = text.replace(word, "")
    return text.strip()

Applied Steps Summary

Step	Description
Lowercasing	All text is converted to lowercase
URL removal	Removes links containing http, www, https
Mention removal	Removes @username style mentions
Special character removal	Removes emojis and symbols (e.g., *, %, $, ^, etc.)
Whitespace normalization	Collapses multiple spaces into one
Slang word removal	Removes common informal words like "kanka", "lan", etc.

Conclusion: cleaned_corrected_text is a lightly cleaned, non - linguistically processed text column. The model is trained directly on this.

Training Details

Trainer: Hugging Face Trainer API
Epochs: 3
Batch size: 16
Learning Rate: 2e - 5
Eval Strategy: Epoch - based
Undersampling: Applied to balance class distribution

Dataset

Dataset used: Overfit-GM/turkish-toxic-language
Final dataset size after preprocessing and balancing: 54068 samples

🔧 Technical Details

The model is a fine - tuned version of dbmdz/bert-base-turkish-cased for binary toxicity classification in Turkish text. It uses basic regex - based cleaning and manual slang filtering for preprocessing. The training is done using the Hugging Face Trainer API with specific hyperparameters and undersampling to balance the class distribution.

📄 License

This project is licensed under the MIT license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご