P

Piiranha

Developed by scampion
A token classification model fine-tuned on ModernBERT-base, specifically designed to identify and classify Personally Identifiable Information (PII) in text
Downloads 79
Release Time : 1/29/2025

Model Overview

This model was trained on the ai4privacy/pii-masking-400k dataset and can detect 17 types of PII categories. It is suitable for privacy protection applications such as data anonymization, information masking, or compliance with data protection regulations.

Model Features

Multi-category PII Detection
Capable of identifying 17 different types of Personally Identifiable Information (PII) categories
High-precision Identification
Achieves 92.1% precision and 92.7% recall on the validation set
Privacy Protection Optimization
Specially optimized for privacy protection scenarios, suitable for data anonymization and masking

Model Capabilities

Identification of Personally Identifiable Information in text
Privacy data classification
Sensitive information detection

Use Cases

Data Privacy Protection
Data Anonymization Processing
Automatically identifies and tags Personally Identifiable Information in datasets for anonymization processing
F1 score reaches 0.924
Compliance Checking
Helps enterprises check whether their data complies with privacy protection regulations such as GDPR
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase