K

Kcelectra Base Bad Sentence Classifier

Developed by JminJ
A Korean text classification model based on ELECTRA architecture, designed to determine if comments and chat content contain sensitive information
Downloads 46
Release Time : 4/7/2022

Model Overview

This model is fine-tuned from ELECTRA, specifically for detecting inappropriate content (such as sensitive information, hate speech, etc.) in Korean text. It is trained on public datasets, but the training data is not disclosed due to copyright issues.

Model Features

Multi-dataset fusion training
Combines the Korean Unsmile and Korean HateSpeech datasets and relabels them into a binary classification format
Specific sensitive word processing
Special tagging for sentences containing specific Korean sensitive words (e.g., '~노', '좆', etc.)
Multi-model comparison
Trains and compares performance using three different Korean ELECTRA models

Model Capabilities

Korean text classification
Sensitive content detection
Hate speech recognition

Use Cases

Content moderation
Social media comment filtering
Automatically identifies and filters inappropriate comments on social media
Accuracy of 88.49% (based on kcElectra_base model)
Chat content monitoring
Real-time monitoring of inappropriate speech in chat applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase