S

Sd Ner

Developed by EMBO
A named entity recognition model fine-tuned on English scientific texts in the life sciences domain, based on the RoBERTa base model
Downloads 14
Release Time : 3/2/2022

Model Overview

This model is specifically designed to identify biological entities in the SourceData annotation system, including 7 types of biomedical entities such as small molecules, gene products, and subcellular components

Model Features

Specialized for Biomedical Domain
Optimized for life science literature, capable of accurately identifying biomedical entities
Multi-category Entity Recognition
Can identify 7 types of biomedical entities, including gene products and small molecules
Optimized Based on RoBERTa
Further trained on biomedical corpora based on the RoBERTa base model

Model Capabilities

Biomedical Entity Recognition
Scientific Text Analysis
Multi-category Classification

Use Cases

Biomedical Literature Analysis
Research Paper Entity Extraction
Extract key biological entities from papers in the life sciences domain
F1 score reaches 0.74 (micro-average)
Experimental Data Annotation
Automatically annotate key information such as experimental methods and cell types
Gene product recognition F1 score reaches 0.82
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase