D

Deid Roberta I2b2

Developed by obi
This model is a sequence labeling model fine-tuned on RoBERTa, designed to identify and remove Protected Health Information (PHI/PII) from medical records.
Downloads 1.1M
Release Time : 3/2/2022

Model Overview

This model is specifically designed for de-identification of Electronic Health Records (EHR), capable of recognizing and classifying 11 types of Protected Health Information entities as defined by HIPAA, including dates, medical personnel, hospitals, ages, etc.

Model Features

HIPAA Compliance
Strictly adheres to the 11 categories of PHI identification standards defined by HIPAA regulations
Context Awareness
Adds 32 tokens of contextual information before and after each sentence to improve recognition accuracy
BILOU Annotation
Uses the BILOU annotation scheme to aggregate token-level predictions into complete entity segments

Model Capabilities

Medical Text Analysis
Sensitive Information Identification
Entity Classification
Text De-identification

Use Cases

Medical Data Privacy Protection
Electronic Health Record Anonymization
Automatically removes patient personal information before sharing medical records
F1 score meets industry standards
Clinical Research Data Preparation
Prepares de-identified patient data for research purposes
Preserves clinical value while protecting patient privacy
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase