Apollo2 7B Imat GGUF

Developed by cgus

Apollo2-7B is a multilingual medical large language model focused on Q&A tasks in biology and medicine.

Large Language Model Supports Multiple LanguagesOpen Source License:Apache-2.0 #Multilingual Medical Q&A #50 Language Coverage #Specialized in Biology and Medicine

Downloads 225

Release Time : 10/18/2024

Model Overview

Apollo2-7B is a multilingual medical large language model designed to provide Q&A support in medical and biological fields for 50 languages. It is based on a dense model architecture, with special emphasis on professional medical knowledge and multilingual capabilities.

Model Features

Multilingual Support

Supports 50 languages, including 12 major languages and 38 minority languages, with special focus on multilingual capabilities in the medical field.

Medical Expertise

Specialized in Q&A tasks in biology and medicine, equipped with professional knowledge in the medical field.

Quantization Support

Offers multiple quantized versions suitable for different hardware environments and application scenarios.

Model Capabilities

Medical Q&A

Biological Q&A

Multilingual Q&A

Use Cases

Medical Consultation

Clinical Knowledge Q&A

Answers professional questions about clinical medical knowledge

Performs well on medical Q&A benchmarks like MedQA-USMLE

Multilingual Medical Consultation

Provides medical information consultation services for users in different languages

Supports medical Q&A in 50 languages

Medical Education

Medical Exam Preparation

Helps medical students prepare for various medical exams

Excels in multiple medical exam benchmark tests

license: apache-2.0 datasets:

FreedomIntelligence/ApolloMoEDataset language:
ar
en
zh
ko
ja
mn
th
vi
lo
mg
de
pt
es
fr
ru
it
hr
gl
cs
co
la
uk
bs
bg
eo
sq
da
sa
gn
sr
sk
gd
lb
hi
ku
mt
he
ln
bm
sw
ig
rw
ha metrics:
accuracy base_model:
FreedomIntelligence/Apollo2-7B pipeline_tag: question-answering tags:
biology
medical

Apollo2-7B-GGUF

Original model: Apollo2-7B
Made by: FreedomIntelligence

Quantization notes

Made with llama.cpp-b3938 with imatrix file based on Exllamav2 callibration dataset.
This model is meant to run with llama.cpp-compatible apps such as Text-Generation-WebUI, KoboldCpp, Jan, LM Studio and many many others.
17.12.2024: Readme update. It seems Q4_0_4_4, Q4_0_4_8 and Q4_0_8_8 support was removed in recent llama.cpp. I'll keep them but they might be no longer useful.
03.02.2025: Added Q4_0 and IQ4_NL quants as a substitute for Q4_0_X_Y quants for ARM devices with newer llama.cpp versions.

Original model card

Democratizing Medical LLMs For Much More Languages

Covering 12 Major Languages including English, Chinese, French, Hindi, Spanish, Arabic, Russian, Japanese, Korean, German, Italian, Portuguese and 38 Minor Languages So far.

📃 Paper • 🌐 Demo • 🤗 ApolloMoEDataset • 🤗 ApolloMoEBench • 🤗 Models •🌐 Apollo • 🌐 ApolloMoE

Apollo

🌈 Update

[2024.10.15] ApolloMoE repo is published！🎉

Languages Coverage

12 Major Languages and 38 Minor Languages

Click to view the Languages Coverage

ApolloMoE

Architecture

Click to view the MoE routing image

ApolloMoE

Results

Dense

🤗 Apollo2-0.5B • 🤗 Apollo2-1.5B • 🤗 Apollo2-2B

🤗 Apollo2-3.8B • 🤗 Apollo2-7B • 🤗 Apollo2-9B

Click to view the Dense Models Results

ApolloMoE

Post-MoE

🤗 Apollo-MoE-0.5B • 🤗 Apollo-MoE-1.5B • 🤗 Apollo-MoE-7B

Click to view the Post-MoE Models Results

ApolloMoE

Usage Format

Apollo2

0.5B, 1.5B, 7B: User:{query}\nAssistant:{response}<|endoftext|>
2B, 9B: User:{query}\nAssistant:{response}<eos>
3.8B: <|user|>\n{query}<|end|><|assisitant|>\n{response}<|end|>

Apollo-MoE

0.5B, 1.5B, 7B: User:{query}\nAssistant:{response}<|endoftext|>

Dataset & Evaluation

Dataset 🤗 ApolloMoEDataset
Click to expand
- Data category
Evaluation 🤗 ApolloMoEBench
Click to expand
- EN:
  - MedQA-USMLE
  - MedMCQA
  - PubMedQA: Because the results fluctuated too much, they were not used in the paper.
  - MMLU-Medical
    - Clinical knowledge, Medical genetics, Anatomy, Professional medicine, College biology, College medicine
- ZH:
  - MedQA-MCMLE
  - CMB-single: Not used in the paper
    - Randomly sample 2,000 multiple-choice questions with single answer.
  - CMMLU-Medical
    - Anatomy, Clinical_knowledge, College_medicine, Genetics, Nutrition, Traditional_chinese_medicine, Virology
  - CExam: Not used in the paper
    - Randomly sample 2,000 multiple-choice questions
- ES: Head_qa
- FR:
  - Frenchmedmcqa
  - [MMLU_FR]
    - Clinical knowledge, Medical genetics, Anatomy, Professional medicine, College biology, College medicine
- HI: MMLU_HI
  - Clinical knowledge, Medical genetics, Anatomy, Professional medicine, College biology, College medicine
- AR: MMLU_AR
  - Clinical knowledge, Medical genetics, Anatomy, Professional medicine, College biology, College medicine
- JA: IgakuQA
- KO: KorMedMCQA
- IT:
  - MedExpQA
  - [MMLU_IT]
    - Clinical knowledge, Medical genetics, Anatomy, Professional medicine, College biology, College medicine
- DE: BioInstructQA: German part
- PT: BioInstructQA: Portuguese part
- RU: RuMedBench

Results reproduction

Click to expand

We take Apollo2-7B or Apollo-MoE-0.5B as example

Download Dataset for project:
```
bash 0.download_data.sh  
```
Prepare test and dev data for specific model:
- Create test data for with special token
```
bash 1.data_process_test&dev.sh
```
Prepare train data for specific model (Create tokenized data in advance):
- You can adjust data Training order and Training Epoch in this step
```
bash 2.data_process_train.sh
```
Train the model
- If you want to train in Multi Nodes please refer to ./src/sft/training_config/zero_multi.yaml
```
bash 3.single_node_train.sh
```
Evaluate your model: Generate score for benchmark
```
bash 4.eval.sh
```

Citation

Please use the following citation if you intend to use our dataset for training or evaluation:

@misc{zheng2024efficientlydemocratizingmedicalllms,
      title={Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts}, 
      author={Guorui Zheng and Xidong Wang and Juhao Liang and Nuo Chen and Yuping Zheng and Benyou Wang},
      year={2024},
      eprint={2410.10626},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2410.10626}, 
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご