Medsam-vit-base Open-source Medical Image Segmentation Model - Free Deployment to Boost Precise Medical Analysis

Home

Medsam Vit Base

Developed by flaviagiammarino

MedSAM is a SAM version fine-tuned specifically for the medical field, designed for medical image segmentation tasks.

Image Segmentation

Transformers

OtherOpen Source License:Apache-2.0 #Medical Image Segmentation #Multimodal Support #Bounding Box Prompt

Downloads 2,944

Release Time : 7/11/2023

Model Overview

MedSAM is a pre-trained SAM model based on the ViT-Base architecture, optimized for medical image segmentation tasks. It was trained on a large-scale medical image dataset containing 1,090,486 image-mask pairs, covering 15 imaging modalities and over 30 cancer types.

Model Features

Medical Domain Optimization

Specially optimized for medical image segmentation tasks, capable of handling multiple medical imaging modalities and cancer types.

Large-scale Training Data

Trained on a large-scale medical image dataset containing 1,090,486 image-mask pairs.

Efficient Training Strategy

Freezes the prompt encoder weights during training and only updates the image encoder and mask decoder weights, improving training efficiency.

Model Capabilities

Medical Image Segmentation

Multimodal Image Processing

Bounding Box-guided Segmentation

Use Cases

Medical Diagnosis

Tumor Region Segmentation

Automatically identifies and segments tumor regions in medical images

Accurately segments lesion areas for over 30 cancer types

Multimodal Image Analysis

Processes images from various medical imaging modalities such as CT and MRI

Supports 15 different medical imaging modalities

🚀 MedSAM Model Card

MedSAM is a fine - tuned version of SAM tailored for the medical domain. This repository builds upon the paper, code, and pre - trained model released by the authors in July 2023.

🚀 Quick Start

MedSAM is a powerful tool for medical image segmentation. It can be easily integrated into your projects with just a few lines of code.

✨ Features

Domain - Specific Tuning: Fine - tuned for the medical domain, making it highly effective for medical image segmentation tasks.
Large - Scale Training: Trained on a large - scale medical image segmentation dataset of 1,090,486 image - mask pairs from diverse public sources.
Broad Coverage: The dataset covers 15 imaging modalities and over 30 cancer types.

📦 Installation

There is no specific installation steps provided in the original document. So this section is skipped.

💻 Usage Examples

Basic Usage

import requests
import numpy as np
import matplotlib.pyplot as plt
from PIL import Image
from transformers import SamModel, SamProcessor
import torch

device = "cuda" if torch.cuda.is_available() else "cpu"

model = SamModel.from_pretrained("flaviagiammarino/medsam-vit-base").to(device)
processor = SamProcessor.from_pretrained("flaviagiammarino/medsam-vit-base")

img_url = "https://huggingface.co/flaviagiammarino/medsam-vit-base/resolve/main/scripts/input.png"
raw_image = Image.open(requests.get(img_url, stream=True).raw).convert("RGB")
input_boxes = [95., 255., 190., 350.]

inputs = processor(raw_image, input_boxes=[[input_boxes]], return_tensors="pt").to(device)
outputs = model(**inputs, multimask_output=False)
probs = processor.image_processor.post_process_masks(outputs.pred_masks.sigmoid().cpu(), inputs["original_sizes"].cpu(), inputs["reshaped_input_sizes"].cpu(), binarize=False)

def show_mask(mask, ax, random_color):
    if random_color:
        color = np.concatenate([np.random.random(3), np.array([0.6])], axis=0)
    else:
        color = np.array([251/255, 252/255, 30/255, 0.6])
    h, w = mask.shape[-2:]
    mask_image = mask.reshape(h, w, 1) * color.reshape(1, 1, -1)
    ax.imshow(mask_image)

def show_box(box, ax):
    x0, y0 = box[0], box[1]
    w, h = box[2] - box[0], box[3] - box[1]
    ax.add_patch(plt.Rectangle((x0, y0), w, h, edgecolor="blue", facecolor=(0, 0, 0, 0), lw=2))

fig, ax = plt.subplots(1, 2, figsize=(10, 5))
ax[0].imshow(np.array(raw_image))
show_box(input_boxes, ax[0])
ax[0].set_title("Input Image and Bounding Box")
ax[0].axis("off")
ax[1].imshow(np.array(raw_image))
show_mask(mask=probs[0] > 0.5, ax=ax[1], random_color=False)
show_box(input_boxes, ax[1])
ax[1].set_title("MedSAM Segmentation")
ax[1].axis("off")
plt.show()

results

📚 Documentation

Model Description

MedSAM was trained on a large - scale medical image segmentation dataset of 1,090,486 image - mask pairs collected from different publicly available sources. The image - mask pairs cover 15 imaging modalities and over 30 cancer types.

MedSAM was initialized using the pre - trained SAM model with the ViT - Base backbone. The prompt encoder weights were frozen, while the image encoder and mask decoder weights were updated during training. The training was performed for 100 epochs with a batch size of 160 using the AdamW optimizer with a learning rate of 10−4 and a weight decay of 0.01.

Repository: MedSAM Official GitHub Repository
Paper: Segment Anything in Medical Images

Additional Information

Licensing Information

The authors have released the model code and pre - trained checkpoint under the Apache License 2.0.

Citation Information

@article{ma2023segment,
  title={Segment anything in medical images},
  author={Ma, Jun and Wang, Bo},
  journal={arXiv preprint arXiv:2304.12306},
  year={2023}
}

📄 License

The model code and pre - trained checkpoint are released under the Apache License 2.0.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご