BERT Spanish Toxicity Open-Source Model - Accurately Identify Toxic Content in Spanish Texts

Bert Spanish Toxicity

Developed by bgonzalezbustamante

A toxicity detection model fine-tuned based on BETO (Spanish BERT base model), designed to identify toxic content in Spanish texts.

Text Classification

Transformers

SpanishOpen Source License:MIT #Spanish toxicity detection #Protest event analysis #Social media content moderation

Downloads 85

Release Time : 11/4/2024

Model Overview

This model is specifically designed for toxicity classification in Spanish texts, capable of distinguishing between non-toxic (NONTOXIC) and toxic (TOXIC) content, primarily used for social media content moderation and online interaction analysis.

Model Features

Spanish-specific

Fine-tuned based on the BERT model (BETO) optimized for Spanish, delivering excellent performance in detecting toxic content in Spanish.

Trained on protest event data

Trained using real social media data from protest events in Latin America, making it particularly suitable for analyzing toxic language in high-conflict scenarios.

Gold standard dataset

Training data comes from a meticulously annotated gold standard dataset, containing approximately 5 million data points.

Model Capabilities

Spanish text classification

Toxic content detection

Social media content analysis

Use Cases

Content moderation

Social media toxic comment filtering

Automatically identify and filter toxic comments in Spanish social media

Accuracy 83.5%, F1 score 84.9%

Social research

Protest event language analysis

Analyze toxicity levels in social media interactions during protest events

Particularly suitable for analyzing protest events in Spanish-speaking countries like Argentina and Chile

Property	Details
Accuracy	0.835
Precision	0.816
Recall	0.886
F1 - Score	0.849

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Bert Spanish Toxicity

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Fined-tuned BERT for Toxicity Classification in Spanish

🚀 Quick Start

✨ Features

💻 Usage Examples

Basic Usage

Output

📚 Documentation

Validation Metrics

📄 License