Sapbert UMLS 2020AB All Lang From XLMR
S
Sapbert UMLS 2020AB All Lang From XLMR
Developed by cambridgeltl
SapBERT is a multilingual biomedical entity linking model based on XLM-RoBERTa, trained with UMLS data, excelling in cross-lingual biomedical term semantic representation
Downloads 430.74k
Release Time : 3/2/2022
Model Overview
SapBERT is a cross-lingual biomedical entity linking model based on the XLM-RoBERTa architecture, specifically designed for handling semantic representation and matching tasks of multilingual biomedical terms. It is trained using the UMLS knowledge base and can generate high-quality term embedding vectors.
Model Features
Cross-lingual Capability
Based on the XLM-RoBERTa architecture, supporting multilingual biomedical term processing
Domain-specific Optimization
Specifically trained using the UMLS biomedical knowledge base
Efficient Semantic Representation
Generates compact semantic embedding vectors through the [CLS] token
Model Capabilities
Biomedical term semantic representation
Cross-lingual term matching
Entity linking
Term embedding generation
Use Cases
Biomedical Information Retrieval
Cross-lingual Medical Term Search
Establishing semantic associations between medical terms in different languages
Improving the accuracy of multilingual medical literature retrieval
Clinical Data Standardization
Non-standard Clinical Term Mapping
Mapping non-standard terms in clinical records to standard medical terms
Enhancing the data quality of electronic health records
Featured Recommended AI Models
Š 2025AIbase