C

Camembert Bio Base

Developed by almanach
CamemBERT - bio is a language model optimized for the French biomedical field. It is based on camembert - base and undergoes continuous pre - training, showing excellent performance in biomedical named entity recognition tasks.
Downloads 6,029
Release Time : 2/23/2023

Model Overview

CamemBERT - bio is an advanced French biomedical language model. Through continuous pre - training on a large - scale French biomedical corpus, its performance in biomedical named entity recognition tasks has been significantly improved.

Model Features

Optimized for professional fields
Designed specifically for the French biomedical field, it performs excellently in biomedical named entity recognition tasks and shows significant performance improvement compared to the base model.
Trained with rich corpus
Trained using a large - scale French biomedical corpus containing scientific literature, drug labels, and clinical cases, with a wide coverage of data.
Efficient training
Using the continuous pre - training method, it has lower computational cost and higher efficiency compared to training from scratch.

Model Capabilities

French biomedical text understanding
Biomedical named entity recognition
Clinical document information extraction

Use Cases

Clinical research
Medical report information extraction
Extract information from unstructured documents in the hospital's clinical data warehouse to support clinical research
The F1 score increased by 2.54 points on the clinical dataset
Drug information processing
Drug label analysis
Extract key information from drug labels
The F1 score reached 76.71 on the EMEA dataset
Scientific literature processing
Biomedical literature analysis
Process and analyze French biomedical scientific literature
The F1 score reached 68.47 on the MEDLINE dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase