B

Bert Spanish Toxicity

Developed by bgonzalezbustamante
A toxicity detection model fine-tuned based on BETO (Spanish BERT base model), designed to identify toxic content in Spanish texts.
Downloads 85
Release Time : 11/4/2024

Model Overview

This model is specifically designed for toxicity classification in Spanish texts, capable of distinguishing between non-toxic (NONTOXIC) and toxic (TOXIC) content, primarily used for social media content moderation and online interaction analysis.

Model Features

Spanish-specific
Fine-tuned based on the BERT model (BETO) optimized for Spanish, delivering excellent performance in detecting toxic content in Spanish.
Trained on protest event data
Trained using real social media data from protest events in Latin America, making it particularly suitable for analyzing toxic language in high-conflict scenarios.
Gold standard dataset
Training data comes from a meticulously annotated gold standard dataset, containing approximately 5 million data points.

Model Capabilities

Spanish text classification
Toxic content detection
Social media content analysis

Use Cases

Content moderation
Social media toxic comment filtering
Automatically identify and filter toxic comments in Spanish social media
Accuracy 83.5%, F1 score 84.9%
Social research
Protest event language analysis
Analyze toxicity levels in social media interactions during protest events
Particularly suitable for analyzing protest events in Spanish-speaking countries like Argentina and Chile
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase