BioM-ALBERT-xxlarge Open-source Biomedical Language Model

Home

Biom ALBERT Xxlarge

Developed by sultan

Large-scale biomedical language model based on BERT, ALBERT, and ELECTRA, specialized for biomedical domain tasks

Large Language Model

Transformers

#Biomedical NLP #PubMed Pretraining #TPU Efficient Training

Downloads 77

Release Time : 3/2/2022

Model Overview

BioM-Transformers are pretrained models adapted for the biomedical domain through various design strategies of large Transformer models, achieving state-of-the-art performance levels in multiple biomedical tasks

Model Features

Biomedical Domain Specialization

Pretrained specifically on PubMed abstracts using a domain-specific vocabulary for biomedicine

Multi-architecture Support

Provides different architecture variants based on BERT, ALBERT, and ELECTRA

Efficient Training

Trained efficiently on TPUv3-512 units with a large batch size of 8192

Resource-friendly

Offers PyTorch XLA implementation, supporting execution on free TPU resources

Model Capabilities

Biomedical text classification

Named Entity Recognition

Question Answering System

Fact-based question answering

Biomedical relation extraction

Use Cases

Biomedical Research

Chemical-Protein Relation Identification

Classifying chemical-protein relations on the ChemProt dataset

Achieved state-of-the-art performance

Biomedical Named Entity Recognition

Identifying specialized terms and entities in biomedical texts

Medical QA Systems

SQuAD2.0 QA

Processing standard QA datasets

BioASQ7B Fact-based QA

Answering fact-based questions in the biomedical domain

🚀 BioM-Transformers: Building Large Biomedical Language Models with BERT, ALBERT and ELECTRA

This project empirically studies biomedical domain adaptation with large transformer models using different design choices. It achieves state - of - the - art results on several biomedical domain tasks with similar or less computational cost.

🚀 Quick Start

The project focuses on investigating the impact of design choices on the performance of biomedical language models. It uses large transformer models for biomedical domain adaptation and evaluates the performance of pretrained models against existing ones in the literature.

✨ Features

Empirically studies biomedical domain adaptation using different design choices with large transformer models.
Achieves state - of - the - art results on several biomedical domain tasks with comparable or lower computational cost.
Provides examples for fine - tuning models on text classification, question - answering tasks, and more.

📚 Documentation

Abstract

The impact of design choices on the performance of biomedical language models recently has been a subject for investigation. In this paper, we empirically study biomedical domain adaptation with large transformer models using different design choices. We evaluate the performance of our pretrained models against other existing biomedical language models in the literature. Our results show that we achieve state - of - the - art results on several biomedical domain tasks despite using similar or less computational cost compared to other models in the literature. Our findings highlight the significant effect of design choices on improving the performance of biomedical language models.

Model Description

This model was pre - trained on PubMed Abstracts only with biomedical domain vocabulary for 264K steps with a batch size of 8192 on TPUv3 - 512 unit. In order to help researchers with limited resources to fine - tune larger models, we created an example with PyTorch XLA. PyTorch XLA (https://github.com/pytorch/xla) is a library that allows you to use PyTorch on TPU units, which is provided for free by Google Colab and Kaggle. Follow this example to work with PyTorch/XLA [Link](https://github.com/salrowili/BioM - Transformers/blob/main/examples/Fine_Tuning_Biomedical_Models_on_Text_Classification_Task_With_HuggingFace_Transformers_and_PyTorch_XLA.ipynb)

Check our GitHub repo at https://github.com/salrowili/BioM - Transformers for TensorFlow and GluonNLP checkpoints. We also updated this repo with a couple of examples on how to fine - tune LMs on text classification and questions answering tasks such as ChemProt, SQuAD, and BioASQ.

Colab Notebook Examples

BioM - ELECTRA - LARGE on NER and ChemProt Task [![Open In Colab][COLAB]](https://colab.research.google.com/github/salrowili/BioM - Transformers/blob/main/examples/Example_of_NER_and_ChemProt_Task_on_TPU.ipynb)
BioM - ELECTRA - Large on SQuAD2.0 and BioASQ7B Factoid tasks [![Open In Colab][COLAB]](https://colab.research.google.com/github/salrowili/BioM - Transformers/blob/main/examples/Example_of_SQuAD2_0_and_BioASQ7B_tasks_with_BioM_ELECTRA_Large_on_TPU.ipynb)
BioM - ALBERT - xxlarge on SQuAD2.0 and BioASQ7B Factoid tasks [![Open In Colab][COLAB]](https://colab.research.google.com/github/salrowili/BioM - Transformers/blob/main/examples/Example_of_SQuAD2_0_and_BioASQ7B_tasks_with_BioM_ALBERT_xxlarge_on_TPU.ipynb)
Text Classification Task With HuggingFace Transformers and PyTorchXLA on Free TPU [![Open In Colab][COLAB]](https://colab.research.google.com/github/salrowili/BioM - Transformers/blob/main/examples/Fine_Tuning_Biomedical_Models_on_Text_Classification_Task_With_HuggingFace_Transformers_and_PyTorch_XLA.ipynb)
Reproducing our BLURB results with JAX [![Open In Colab][COLAB]](https://colab.research.google.com/github/salrowili/BioM - Transformers/blob/main/examples/BLURB_LeaderBoard_with_TPU_VM.ipynb)
Finetunning BioM - Transformers with Jax/Flax on TPUv3 - 8 with free Kaggle resource [![Open In Colab][COLAB]](https://www.kaggle.com/code/sultanalrowili/biom - transoformers - with - flax - on - tpu - with - kaggle)

[COLAB]: https://colab.research.google.com/assets/colab - badge.svg

📄 License

No license information is provided in the original README, so this section is skipped.

Acknowledgment

We would like to acknowledge the support we have from Tensorflow Research Cloud (TFRC) team to grant us access to TPUv3 units.

Citation

@inproceedings{alrowili-shanker-2021-biom,
title = "{B}io{M}-Transformers: Building Large Biomedical Language Models with {BERT}, {ALBERT} and {ELECTRA}",
author = "Alrowili, Sultan and
Shanker, Vijay",
booktitle = "Proceedings of the 20th Workshop on Biomedical Language Processing",
month = jun,
year = "2021",
address = "Online",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/2021.bionlp-1.24",
pages = "221--227",
abstract = "The impact of design choices on the performance of biomedical language models recently has been a subject for investigation. In this paper, we empirically study biomedical domain adaptation with large transformer models using different design choices. We evaluate the performance of our pretrained models against other existing biomedical language models in the literature. Our results show that we achieve state-of-the-art results on several biomedical domain tasks despite using similar or less computational cost compared to other models in the literature. Our findings highlight the significant effect of design choices on improving the performance of biomedical language models.",
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご