Bert Spanish Toxicity
B
Bert Spanish Toxicity
Developed by bgonzalezbustamante
A toxicity detection model fine-tuned based on BETO (Spanish BERT base model), designed to identify toxic content in Spanish texts.
Downloads 85
Release Time : 11/4/2024
Model Overview
This model is specifically designed for toxicity classification in Spanish texts, capable of distinguishing between non-toxic (NONTOXIC) and toxic (TOXIC) content, primarily used for social media content moderation and online interaction analysis.
Model Features
Spanish-specific
Fine-tuned based on the BERT model (BETO) optimized for Spanish, delivering excellent performance in detecting toxic content in Spanish.
Trained on protest event data
Trained using real social media data from protest events in Latin America, making it particularly suitable for analyzing toxic language in high-conflict scenarios.
Gold standard dataset
Training data comes from a meticulously annotated gold standard dataset, containing approximately 5 million data points.
Model Capabilities
Spanish text classification
Toxic content detection
Social media content analysis
Use Cases
Content moderation
Social media toxic comment filtering
Automatically identify and filter toxic comments in Spanish social media
Accuracy 83.5%, F1 score 84.9%
Social research
Protest event language analysis
Analyze toxicity levels in social media interactions during protest events
Particularly suitable for analyzing protest events in Spanish-speaking countries like Argentina and Chile
Featured Recommended AI Models