S

Sentencesegmenter MIMIC

Developed by dongfangxu
This model is based on BiomedBERT and is used for sentence segmentation of MIMIC-III clinical records, predicting BIO annotations.
Downloads 14.96k
Release Time : 4/22/2025

Model Overview

This model is a token classification model specifically designed for processing clinical text, predicting sentence boundaries (B for sentence beginning, I for sentence interior, O for sentence exterior).

Model Features

Clinical Text Optimization
Optimized for biomedical and clinical text based on BiomedBERT.
BIO Annotation
Accurately identifies sentence boundaries using the BIO annotation scheme.
MIMIC-III Training
Trained on the MIMIC-III clinical records dataset, suitable for real-world clinical data.

Model Capabilities

Clinical Text Processing
Sentence Boundary Detection
Sequence Labeling

Use Cases

Clinical Record Processing
Electronic Health Record Analysis
Automatically segments sentences in clinical records for subsequent information extraction and analysis.
Relevant performance metrics were reported in the EMNLP 2024 paper.
Featured Recommended AI Models
ยฉ 2025AIbase