Virchow Open Source Medical Image Model - Free for Slide-level Feature Extraction and Pathology Task Analysis

Virchow

Developed by paige-ai

Virchow is a self-supervised vision Transformer pretrained on 1.5 million whole-slide histopathology images, serving as a slide-level feature extractor for computational pathology downstream tasks.

Image Classification Open Source License:Apache-2.0 #Pathological Feature Extraction #Self-supervised ViT #Medical Image Analysis

Downloads 5,121

Release Time : 6/5/2024

Model Overview

Virchow is a vision Transformer model for computational pathology, pretrained through self-supervised learning on 1.5 million whole-slide histopathology images, and can be used as a feature extractor for various downstream tasks.

Model Features

Large-scale Pretraining

Self-supervised pretraining on 1.5 million whole-slide histopathology images, featuring powerful feature extraction capabilities.

Advanced Architecture

Utilizes ViT-H/14 architecture with 32 layers, 1280-dimensional embeddings, 16 attention heads, and SwiGLU activation function.

Medical Specialization

Specifically optimized for histopathology images, achieving state-of-the-art performance in computational pathology tasks.

Flexible Application

Can be used as either a frozen feature extractor or fine-tuned model, supporting various combinations of classification tokens and patch tokens.

Model Capabilities

Histopathology image feature extraction

Whole-slide image analysis

Computational pathology task support

Medical image embedding generation

Use Cases

Medical Research

Cancer Detection

Used as a foundational model for rare cancer detection research

Achieved clinical-grade performance in studies published in Nature Medicine

Pathology Classification

Used to build slide-level or whole-slide level classifiers

Academic Research

Computational Pathology Research

Used as a pretrained model for various computational pathology downstream tasks

🚀 Virchow - Image Feature Extractor

✨ Virchow is a self - supervised vision transformer. Pretrained with 1.5M whole slide histopathology images, it serves as a powerful tile - level feature extractor. Whether frozen or finetuned, it can achieve state - of - the - art results in various downstream computational pathology use cases.

⚠️ Important Note

🎉 Check out the latest version of Virchow here: Virchow2

🚀 Quick Start

Requirements

PyTorch (2.0+ recommended)
timm (>= 0.9.11 required)
huggingface_hub

Login

After gaining access to the model, you need to log in to HuggingFace in the environment where you want to use the model. You can do this from the command line:

huggingface-cli login

or in your Python code:

from huggingface_hub import login

login()

Please refer to the official HuggingFace documentation for more details.

✨ Features

Powerful Feature Extraction: Can be used as a tile - level feature extractor, either frozen or finetuned, to achieve excellent results in various computational pathology scenarios.
Self - Supervised Learning: Pretrained using 1.5M whole slide histopathology images, enabling it to learn rich visual features.

📦 Installation

The installation mainly involves installing the required libraries. You can use pip to install them:

pip install torch timm huggingface_hub

💻 Usage Examples

Basic Usage

import timm
import torch
from timm.data import resolve_data_config
from timm.data.transforms_factory import create_transform
from timm.layers import SwiGLUPacked
from PIL import Image

# need to specify MLP layer and activation function for proper init
model = timm.create_model("hf - hub:paige - ai/Virchow", pretrained=True, mlp_layer=SwiGLUPacked, act_layer=torch.nn.SiLU)
model = model.eval()

transforms = create_transform(**resolve_data_config(model.pretrained_cfg, model=model))

image = Image.open("/path/to/your/image.png")
image = transforms(image).unsqueeze(0)  # size: 1 x 3 x 224 x 224

output = model(image)  # size: 1 x 257 x 1280

class_token = output[:, 0]    # size: 1 x 1280
patch_tokens = output[:, 1:]  # size: 1 x 256 x 1280

# concatenate class token and average pool of patch tokens
embedding = torch.cat([class_token, patch_tokens.mean(1)], dim=-1)  # size: 1 x 2560

Advanced Usage

We highly recommend running the model on a GPU in mixed precision (fp16) using torch.autocast:

model = model.to("cuda")
image = image.to("cuda")

with torch.inference_mode(), torch.autocast(device_type="cuda", dtype=torch.float16):
  output = model(image)

class_token = output[:, 0]
patch_tokens = output[:, 1:]

embedding = torch.cat([class_token, patch_tokens.mean(1)], dim=-1)

# the model output will be fp32 because the final operation is a LayerNorm that is ran in mixed precision
# optionally, you can convert the embedding to fp16 for efficiency in downstream use
embedding = embedding.to(torch.float16)

📚 Documentation

Model Details

Property	Details
Developed by	Paige, NYC, USA and Microsoft Research, Cambridge, MA USA
Model Type	Image feature backbone
Model Stats	Params (M): 632 Image size: 224 x 224
Model Architecture	Architecture: ViT - H/14 Patch size: 14 Layers: 32 Embedding dimension: 1280 Activation function: SwiGLU Attention heads: 16 LayerScale: true
Training Details	Precision: Mixed precision (`fp16`) Objective: DINOv2 (https://doi.org/10.48550/arXiv.2304.07193)
Paper	A foundation model for clinical - grade computational pathology and rare cancers detection: https://www.nature.com/articles/s41591 - 024 - 03141 - 0
Pretraining Dataset	Internal dataset of 1.5 million whole slide images from Memorial Sloan Kettering Cancer Center, tiles sampled at 0.5 microns per pixel resolution (20x magnification).
License	Apache 2.0

Use

Direct use

Virchow is intended to be used as a frozen feature extractor as the foundation for tile - level and whole slide - level classifiers.

Downstream use

Virchow can be finetuned to adapt to specific tasks and/or datasets.

📄 License

The Virchow Model and associated code are released under the Apache License, Version 2.0 (the "License"). You may obtain a copy of the License at:

http://www.apache.org/licenses/LICENSE - 2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Additional Terms

Please note that the primary email used to sign up for your Hugging Face account must match your institutional email to receive approval. By downloading the Virchow Model, you attest that all information (affiliation, research use) is correct and up - to - date. Downloading the Virchow Model requires prior registration on Hugging Face and agreeing to the terms of use.

While the Apache 2.0 License grants broad permissions, we kindly request that users adhere to the following guidelines:

Attribution: We encourage proper attribution when using or redistributing the Virchow Model or its derivatives. Please include a reference to the original source and creators.
Responsible Use: Users are expected to use the Virchow Model responsibly and ethically. Please consider the potential impacts of your use on individuals and society.
Medical or Clinical Use: The Virchow Model is not intended for use in medical diagnosis, treatment, or prevention of disease of real patients. It should not be used as a substitute for professional medical advice.
Privacy and Data Protection: Users should respect privacy rights and comply with applicable data protection laws when using the Virchow Model.
No Malicious Use: The Virchow Model should not be used to create malicious code, malware, or to interfere with the proper functioning of computer systems.
Transparency: If you use the Virchow Model in a product or service, we encourage you to disclose this fact to your end users.
Feedback and Contributions: We welcome feedback and contributions to improve the Virchow Model. Please consider sharing your improvements with the community.

These additional terms are not intended to restrict your rights under the Apache 2.0 License but to promote responsible and ethical use of the Virchow Model.

By using the Virchow Model, you acknowledge that you have read and understood these terms.

📚 Citation

Please cite the following work if you used this model in your research.

Vorontsov, E., Bozkurt, A., Casson, A. et al. A foundation model for clinical - grade computational pathology and rare cancers detection. Nat Med (2024). https://doi.org/10.1038/s41591 - 024 - 03141 - 0

@article{vorontsov2024virchow,
  title={A foundation model for clinical - grade computational pathology and rare cancers detection},
  author={Vorontsov, Eugene and Bozkurt, Alican and Casson, Adam and Shaikovski, George and Zelechowski, Michal and Severson, Kristen and Zimmermann, Eric and Hall, James and Tenenholtz, Neil and Fusi, Nicolo and Yang, Ellen and Mathieu, Philippe and van Eck, Alexander and Lee, Donghun and Viret, Julian and Robert, Eric and Wang, Yi Kan and Kunz, Jeremy D. and Lee, Matthew C. H. and Bernhard, Jan H. and Godrich, Ran A. and Oakley, Gerard and Millar, Ewan and Hanna, Matthew and Wen, Hannah and Retamero, Juan A. and Moye, William A. and Yousfi, Razik and Kanan, Christopher and Klimstra, David S. and Rothrock, Brandon and Liu, Siqi and Fuchs, Thomas J.},
  journal={Nature Medicine},
  year={2024},
  publisher={Nature Publishing Group}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご