Kcelectra Base Bad Sentence Classifier
K
Kcelectra Base Bad Sentence Classifier
Developed by JminJ
A Korean text classification model based on ELECTRA architecture, designed to determine if comments and chat content contain sensitive information
Downloads 46
Release Time : 4/7/2022
Model Overview
This model is fine-tuned from ELECTRA, specifically for detecting inappropriate content (such as sensitive information, hate speech, etc.) in Korean text. It is trained on public datasets, but the training data is not disclosed due to copyright issues.
Model Features
Multi-dataset fusion training
Combines the Korean Unsmile and Korean HateSpeech datasets and relabels them into a binary classification format
Specific sensitive word processing
Special tagging for sentences containing specific Korean sensitive words (e.g., '~노', '좆', etc.)
Multi-model comparison
Trains and compares performance using three different Korean ELECTRA models
Model Capabilities
Korean text classification
Sensitive content detection
Hate speech recognition
Use Cases
Content moderation
Social media comment filtering
Automatically identifies and filters inappropriate comments on social media
Accuracy of 88.49% (based on kcElectra_base model)
Chat content monitoring
Real-time monitoring of inappropriate speech in chat applications
Featured Recommended AI Models