S

Sapbert UMLS 2020AB All Lang From XLMR

Developed by cambridgeltl
SapBERT is a multilingual biomedical entity linking model based on XLM-RoBERTa, trained with UMLS data, excelling in cross-lingual biomedical term semantic representation
Downloads 430.74k
Release Time : 3/2/2022

Model Overview

SapBERT is a cross-lingual biomedical entity linking model based on the XLM-RoBERTa architecture, specifically designed for handling semantic representation and matching tasks of multilingual biomedical terms. It is trained using the UMLS knowledge base and can generate high-quality term embedding vectors.

Model Features

Cross-lingual Capability
Based on the XLM-RoBERTa architecture, supporting multilingual biomedical term processing
Domain-specific Optimization
Specifically trained using the UMLS biomedical knowledge base
Efficient Semantic Representation
Generates compact semantic embedding vectors through the [CLS] token

Model Capabilities

Biomedical term semantic representation
Cross-lingual term matching
Entity linking
Term embedding generation

Use Cases

Biomedical Information Retrieval
Cross-lingual Medical Term Search
Establishing semantic associations between medical terms in different languages
Improving the accuracy of multilingual medical literature retrieval
Clinical Data Standardization
Non-standard Clinical Term Mapping
Mapping non-standard terms in clinical records to standard medical terms
Enhancing the data quality of electronic health records
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase