modernbert-large-bias-type-classifier Open-source Text Classification Model - Detect Various Bias Types in Categorized Texts

Modernbert Large Bias Type Classifier

Developed by cirimus

A text classification model fine-tuned based on ModernBERT-large, designed to detect and classify various types of bias in text.

Text Classification

Transformers

EnglishOpen Source License:MIT #Bias Detection #Multi-label Classification #Ethical AI

Downloads 424

Release Time : 1/27/2025

Model Overview

This model can identify and classify multiple types of bias in text, including racial, religious, gender, age, and more. It serves as a crucial tool for bias detection and mitigation in natural language processing.

Model Features

Multi-label Classification

Capable of identifying multiple types of bias in text simultaneously.

High Accuracy

Performs exceptionally well on synthetic datasets, achieving a macro-average F1 score of 0.921.

Broad Coverage

Covers 11 different types of bias, including racial, religious, gender, and more.

Based on ModernBERT

Leverages the powerful performance of ModernBERT-large through fine-tuning.

Model Capabilities

Text Classification

Bias Detection

Multi-label Classification

Use Cases

Content Moderation

Social Media Content Moderation

Automatically detects biased content in social media posts.

Helps platforms identify and address potential biased statements.

Research Analysis

Bias Research

Used in academic research to analyze bias patterns in text.

Provides quantitative data to support bias research.

Ethical AI Development

AI System Bias Detection

Integrated into AI systems to detect bias in outputs.

Helps develop fairer AI systems.

🚀 ModernBERT-large Bias Type Classifier

This model is fine - tuned from ModernBERT - large to classify text bias into multiple categories, which is useful for bias detection and mitigation in NLP tasks.

🚀 Quick Start

This model was fine - tuned from [ModernBERT - large](https://huggingface.co/answerdotai/ModernBERT - large) on a synthetic dataset of biased statements and questions. The dataset was generated by Mistal 7B as part of the GUS - Net paper. It can identify and classify text bias into multiple categories, such as racial, religious, gender, age, and other biases.

✨ Features

Multi - label Classification: Capable of classifying text bias into 11 different categories.
Fine - tuned Model: Fine - tuned from [ModernBERT - large](https://huggingface.co/answerdotai/ModernBERT - large) for better bias detection performance.
Useful for NLP Tasks: A valuable tool for bias detection and mitigation in natural language processing tasks.

📦 Installation

No specific installation steps are provided in the original document. If using the transformers library, you can install it via pip install transformers.

💻 Usage Examples

Basic Usage

from transformers import pipeline

# Load the model
classifier = pipeline(
    "text - classification",
    model="cirimus/modernbert-large-bias-type-classifier",
    return_all_scores=True
)

text = "Tall people are so clumsy."
predictions = classifier(text)

# Print predictions
for pred in sorted(predictions[0], key=lambda x: x['score'], reverse=True)[:5]:
    print(f"{pred['label']}: {pred['score']:.3f}")

# Output:
# physical: 1.000
# socioeconomic: 0.002
# gender: 0.002
# racial: 0.001
# age: 0.001

📚 Documentation

Model Details

Property	Details
Base Model	[ModernBERT - large](https://huggingface.co/answerdotai/ModernBERT - large)
Fine - Tuning Dataset	Synthetic biased corpus
Number of Labels	11
Problem Type	Multi - label classification
Language	English
License	MIT
Fine - Tuning Framework	Hugging Face Transformers

How the Model Was Created

The model was fine - tuned for bias detection using the following hyperparameters:

Learning Rate: 3e - 5
Batch Size: 16
Weight Decay: 0.01
Warmup Steps: 500
Optimizer: AdamW
Evaluation Metrics: Precision, Recall, F1 Score (weighted), Accuracy

Dataset

The synthetic dataset consists of biased statements and questions generated by Mistal 7B as part of the GUS - Net paper. It covers 11 bias categories:

Racial
Religious
Gender
Age
Nationality
Sexuality
Socioeconomic
Educational
Disability
Political
Physical

Evaluation Results

The model was evaluated on the synthetic dataset’s test split. The overall metrics using a threshold of 0.5 are as follows:

Macro Averages:

Metric	Value
Accuracy	0.983
Precision	0.930
Recall	0.914
F1	0.921
MCC	0.912

Per - Label Results:

Label	Accuracy	Precision	Recall	F1	MCC	Support	Threshold
Racial	0.975	0.871	0.889	0.880	0.866	388	0.5
Religious	0.994	0.962	0.970	0.966	0.962	335	0.5
Gender	0.976	0.930	0.925	0.927	0.913	615	0.5
Age	0.990	0.964	0.931	0.947	0.941	375	0.5
Nationality	0.972	0.924	0.881	0.902	0.886	554	0.5
Sexuality	0.993	0.960	0.957	0.958	0.955	301	0.5
Socioeconomic	0.964	0.909	0.818	0.861	0.842	516	0.5
Educational	0.982	0.873	0.933	0.902	0.893	330	0.5
Disability	0.986	0.923	0.887	0.905	0.897	283	0.5
Political	0.988	0.958	0.938	0.948	0.941	438	0.5
Physical	0.993	0.961	0.920	0.940	0.936	238	0.5

Intended Use

The model is designed to detect and classify bias in text across 11 categories. It can be used in applications such as:

Content moderation
Bias analysis in research
Ethical AI development

Limitations and Biases

⚠️ Important Note

The dataset consists of synthetic text, which may not fully represent real - world biases.

Certain biases may overlap, leading to challenges in precise classification.

The model may not generalize well to domains outside the synthetic dataset’s scope.

Environmental Impact

Hardware Used: NVIDIA RTX4090
Training Time: ~2 hours
Carbon Emissions: ~0.08 kg CO2 (calculated via ML CO2 Impact Calculator).

📄 License

This model is released under the MIT license.

🔧 Technical Details

The model was fine - tuned from [ModernBERT - large](https://huggingface.co/answerdotai/ModernBERT - large) on a synthetic dataset. The fine - tuning process used specific hyperparameters and evaluation metrics to achieve good performance in bias classification.

📖 Citation

If you use this model, please cite it as follows:

@inproceedings{JunquedeFortuny2025c,
  title = {Bias Detection with ModernBERT - Large},
  author = {Enric Junqué de Fortuny},
  year = {2025},
  howpublished = {\url{https://huggingface.co/cirimus/modernbert-large-bias-type-classifier}},
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご