D

Distilbert Finetuned Ai4privacy V2

Developed by Isotonic
A PII (Personally Identifiable Information) recognition model fine-tuned based on distilbert-base-uncased, designed to detect and remove sensitive information from text
Downloads 3,499
Release Time : 11/20/2023

Model Overview

This model is fine-tuned on the world's largest open-source privacy dataset, capable of identifying 54 types of sensitive information, suitable for privacy protection in AI assistants and LLM scenarios

Model Features

Extensive PII Recognition Capability
Supports identification of 54 types of sensitive data, including financial information, identity markers, contact details, etc.
Efficient Lightweight Model
Based on the DistilBERT architecture, it reduces computational resource requirements while maintaining high accuracy
Multi-scenario Applicability
Training data covers 229 discussion topics and 5 interaction styles, suitable for various text scenarios

Model Capabilities

Sensitive information detection in text
Personally identifiable information recognition
Privacy data classification
Multi-category entity recognition

Use Cases

Privacy Protection
AI Chat Log Anonymization
Automatically identifies and masks sensitive information in chat logs
F1 score reaches 0.9549
Document Privacy Review
Scans documents for personally identifiable information to comply with privacy regulations like GDPR
Email recognition F1 score 1.0
Data Security
Log Anonymization
Automatically removes sensitive data from system logs
IP address recognition F1 score 0.4349
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase