Medalpaca-7B Open-Source Medical Language Model - Free Deployment to Support Precise Medical Q&A and Conversations

Medalpaca 7b

Developed by medalpaca

A 7-billion-parameter language model optimized for the medical field, fine-tuned based on the LLaMA architecture, excelling in medical Q&A and dialogue tasks

Large Language Model

Transformers

EnglishOpen Source License:CC #Medical Q&A #Medical Knowledge Base #Clinical Decision Support

Downloads 9,352

Release Time : 3/29/2023

Model Overview

A large language model fine-tuned for medical tasks, enhancing performance in medical Q&A and dialogue, suitable for medical knowledge queries and decision support

Model Features

Medical Domain Optimization

Specially fine-tuned for medical Q&A and dialogue tasks, demonstrating excellent performance in medical knowledge comprehension

Multi-source Training Data

Integrates Anki flashcards, Wikidoc medical encyclopedia, StackExchange professional Q&A, and ChatDoctor datasets

Comprehensive Knowledge Coverage

Training data covers medical knowledge such as symptoms and treatment plans for common diseases like diabetes

Model Capabilities

Medical Q&A Generation

Medical Knowledge Query

Patient Education Content Generation

Clinical Decision Support

Use Cases

Medical Education

Medical Student Learning Aid

Helps medical students quickly query basic knowledge such as disease symptoms and treatment plans

Provides medical Q&A support with approximately 70% accuracy

Clinical Assistance

Symptom Query

Doctors input patient symptoms to obtain possible disease lists and differential diagnosis suggestions

🚀 MedAlpaca 7b

A large language model fine - tuned for medical domain tasks

📚 Documentation

🔍 Model Description

🏗️ Architecture

medalpaca-7b is a large language model specifically fine - tuned for medical domain tasks. It is based on LLaMA (Large Language Model Meta AI) and contains 7 billion parameters. The primary goal of this model is to improve question - answering and medical dialogue tasks.

📊 Training Data

The training data for this project was sourced from various resources.

Firstly, we used Anki flashcards to automatically generate questions from the front of the cards and answers from the back of the card.
Secondly, we generated medical question - answer pairs from Wikidoc. We extracted paragraphs with relevant headings, and used Chat - GPT 3.5 to generate questions from the headings and used the corresponding paragraphs as answers. This dataset is still under development and we believe that approximately 70% of these question - answer pairs are factually correct.
Thirdly, we used StackExchange to extract question - answer pairs, taking the top - rated question from five categories: Academia, Bioinformatics, Biology, Fitness, and Health.
Additionally, we used a dataset from ChatDoctor consisting of 200,000 question - answer pairs, available at https://github.com/Kent0n - Li/ChatDoctor.

Property	Details
ChatDoc large	200000
wikidoc	67704
Stackexchange academia	40865
Anki flashcards	33955
Stackexchange biology	27887
Stackexchange fitness	9833
Stackexchange health	7721
Wikidoc patient information	5942
Stackexchange bioinformatics	5407

💻 Usage Examples

📈 Model Usage

To evaluate the performance of the model on a specific dataset, you can use the Hugging Face Transformers library's built - in evaluation scripts. Please refer to the evaluation guide for more information.

You can use the model for inference tasks like question - answering and medical dialogues using the Hugging Face Transformers library. Here's an example of how to use the model for a question - answering task:

from transformers import pipeline

pl = pipeline("text-generation", model="medalpaca/medalpaca-7b", tokenizer="medalpaca/medalpaca-7b")
question = "What are the symptoms of diabetes?"
context = "Diabetes is a metabolic disease that causes high blood sugar. The symptoms include increased thirst, frequent urination, and unexplained weight loss."
answer = pl(f"Context: {context}\n\nQuestion: {question}\n\nAnswer: ")
print(answer)

⚠️ Limitations

The model may not perform effectively outside the scope of the medical domain.
The training data primarily targets the knowledge level of medical students, which may result in limitations when addressing the needs of board - certified physicians.
The model has not been tested in real - world applications, so its efficacy and accuracy are currently unknown. It should never be used as a substitute for a doctor's opinion and must be treated as a research tool only.

📊 Open LLM Leaderboard Evaluation Results

Detailed results can be found [here](https://huggingface.co/datasets/open - llm - leaderboard/details_medalpaca__medalpaca - 7b)

Metric	Value
Avg.	44.98
ARC (25 - shot)	54.1
HellaSwag (10 - shot)	80.42
MMLU (5 - shot)	41.47
TruthfulQA (0 - shot)	40.46
Winogrande (5 - shot)	71.19
GSM8K (5 - shot)	3.03
DROP (3 - shot)	24.21

📄 License

The model is released under the CC license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご