MedGuanaco-65B-GPTQ Open-Source Medical Large Model - Enhancing Performance in Medical Q&A and Conversation Tasks

Medguanaco 65b GPTQ

Developed by nmitchko

A large language model LoRa fine-tuned specifically for medical domain tasks, based on the 65B-parameter LLaMA Guanaco LORA, enhancing performance in Q&A and medical dialogue tasks

Large Language Model

Transformers

EnglishOpen Source License:CC #Medical Q&A #LoRA Fine-tuning #65B Large Parameters

Downloads 49

Release Time : 6/7/2023

Model Overview

A language model optimized for the medical field, primarily used for medical Q&A and dialogue tasks, fine-tuned with LoRA technology and compressed to 8-bit to reduce memory usage

Model Features

Medical Domain Optimization

Fine-tuned specifically for medical Q&A and dialogue tasks, improving performance in the medical field

Efficient Training Technology

Fine-tuned using LoRA technology and compressed to 8-bit to reduce memory usage

Large-Scale Parameters

Based on the 65B-parameter LLaMA Guanaco LORA model, equipped with powerful language understanding capabilities

Model Capabilities

Medical Q&A

Medical Dialogue Generation

Medical Knowledge Retrieval

Use Cases

Medical Education

Medical Student Q&A

Answering medical students' questions about basic medical knowledge

Medical Information Consultation

Patient Health Consultation

Providing basic medical and health information consultation

🚀 Medguanaco 65b

A large language model LoRA specifically fine - tuned for medical domain tasks, aiming to improve question - answering and medical dialogue.

🚀 Quick Start

The Medguanaco 65b model is designed for medical domain tasks. Here are the steps to load this model:

# Some llama or alpaca model 65b
base_model = "nmitchko/medguanaco-65b-GPTQ"
model = LlamaForCausalLM.from_pretrained(
    base_model,    
    load_in_8bit=load_8bit,
    torch_dtype=torch.float16
)

✨ Features

Specifically fine - tuned for medical domain tasks, improving question - answering and medical dialogue.
Based on the Guanaco LORA of LLaMA with 65B parameters.
Trained using LoRA and reduced to 8bit to reduce memory footprint.

📚 Documentation

Architecture

nmitchko/medguanaco-65b-GPTQ is a large language model LoRA specifically fine - tuned for medical domain tasks. It is based on the Guanaco LORA of LLaMA weighing in at 65B parameters. The primary goal of this model is to improve question - answering and medical dialogue tasks. It was trained using LoRA and reduced to 8bit, to reduce memory footprint.

Training Data

The training data for this project was sourced from various resources. Firstly, we used Anki flashcards to automatically generate questions from the front of the cards and answers from the back of the card. Secondly, we generated medical question - answer pairs from Wikidoc. We extracted paragraphs with relevant headings, and used Chat - GPT 3.5 to generate questions from the headings and using the corresponding paragraphs as answers. This dataset is still under development and we believe that approximately 70% of these question answer pairs are factual correct. Thirdly, we used StackExchange to extract question - answer pairs, taking the top - rated question from five categories: Academia, Bioinformatics, Biology, Fitness, and Health. Additionally, we used a dataset from ChatDoctor consisting of 200,000 question - answer pairs, available at https://github.com/Kent0n - Li/ChatDoctor.

Source	n items
ChatDoc large	200000
wikidoc	67704
Stackexchange academia	40865
Anki flashcards	33955
Stackexchange biology	27887
Stackexchange fitness	9833
Stackexchange health	7721
Wikidoc patient information	5942
Stackexchange bioinformatics	5407

🔧 Technical Details

The model is based on the Guanaco LORA of LLaMA with 65B parameters. It uses the LoRA technique for training and is reduced to 8bit to save memory. The training data is a combination of data from Anki flashcards, Wikidoc, StackExchange, and ChatDoctor.

📄 License

The model is licensed under CC.

⚠️ Important Note

The model may not perform effectively outside the scope of the medical domain. The training data primarily targets the knowledge level of medical students, which may result in limitations when addressing the needs of board - certified physicians. The model has not been tested in real - world applications, so its efficacy and accuracy are currently unknown. It should never be used as a substitute for a doctor's opinion and must be treated as a research tool only.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご