B

Bsc Bio Es

Developed by PlanTL-GOB-ES
Pre-trained language model specifically designed for the Spanish biomedical domain, suitable for clinical NLP tasks
Downloads 162
Release Time : 4/8/2022

Model Overview

Biomedical domain-specific model based on the RoBERTa architecture, trained on 963 million tokens of Spanish biomedical corpus, supports masked language modeling tasks, particularly suitable for clinical text processing

Model Features

Domain Specialization
Specifically trained on Spanish biomedical texts, including professional corpora such as clinical cases and medical literature
High Performance
Outperforms general and multilingual models in three clinical NER tasks such as PharmaCoNER
Large-scale Training Data
Trained on 963 million tokens of cleaned biomedical corpus, including multi-source data such as medical crawlers, clinical cases, and patent data

Model Capabilities

Biomedical text understanding
Clinical entity recognition
Medical text classification
Medical text completion

Use Cases

Clinical Information Extraction
Drug Name Recognition
Identify chemical and drug mentions from clinical texts
Achieved 0.8907 F1 score on PharmaCoNER task
Oncology Terminology Recognition
Identify Spanish oncology morphological terms
Achieved 0.8220 F1 score on CANTEMIST task
Clinical Document Analysis
Discharge Report Analysis
Process clinical variables in stroke patient discharge reports
Achieved 0.8727 F1 score on ICTUSnet dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase