L

Llava Med 7b Delta

Developed by microsoft
LLaVA-Med is a biomedical multimodal model constructed through visual instruction fine-tuning, capable of processing biomedical images and text.
Downloads 257
Release Time : 11/9/2023

Model Overview

LLaVA-Med is a biomedical vision-language model initialized from LLaVA, fine-tuned on biomedical data through curriculum learning, focusing on biomedical visual question answering and dialogue tasks.

Model Features

Biomedical Domain Adaptation
Optimized specifically for the biomedical domain through curriculum learning
Multimodal Capability
Simultaneously processes biomedical images and related textual information
Research Only
Focused on biomedical research applications, not suitable for clinical decision-making

Model Capabilities

Biomedical Image Understanding
Biomedical Text Understanding
Visual Question Answering
Multimodal Dialogue

Use Cases

Medical Research
Biomedical Literature Analysis
Analyzing charts and textual content in medical literature
Performs excellently on benchmarks like PathVQA and VQA-RAD
Medical Education
Assisting in understanding visual content for medical education
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase