sam2-hiera-small開源圖像視頻分割模型 - 免費解決可提示視覺分割任務

首頁

Sam2 Hiera Small

由facebook開發

FAIR研發的基礎模型，用於解決圖像和視頻中可提示視覺分割任務

圖像分割開源協議:Apache-2.0 #圖像視頻分割 #可提示分割 #零樣本學習

下載量 12.98k

發布時間 : 8/2/2024

模型概述

SAM 2是一個用於圖像和視頻分割的基礎模型，支持通過提示（如點或框）進行交互式分割。

模型特點

多模態提示支持

支持通過點、框等多種提示方式進行交互式分割

圖像視頻通用

同一模型可同時處理圖像和視頻分割任務

高效推理

支持bfloat16精度和CUDA加速，提高推理效率

模型能力

圖像分割

視頻分割

交互式分割

掩碼生成

使用案例

計算機視覺

圖像對象分割

通過點或框提示分割圖像中的特定對象

生成精確的對象掩碼

視頻對象跟蹤

在視頻序列中跟蹤和分割移動對象

生成連續幀中的一致對象掩碼

🚀 SAM 2：圖像和視頻中的任意分割

SAM 2 是一個基礎模型，由 FAIR 開發，旨在解決圖像和視頻中的可提示視覺分割問題。更多信息請參閱 SAM 2 論文。

官方代碼已在這個倉庫中公開。

🚀 快速開始

💻 使用示例

基礎用法

以下是圖像預測的代碼示例：

import torch
from sam2.sam2_image_predictor import SAM2ImagePredictor

predictor = SAM2ImagePredictor.from_pretrained("facebook/sam2-hiera-small")

with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
    predictor.set_image(<your_image>)
    masks, _, _ = predictor.predict(<input_prompts>)

高級用法

以下是視頻預測的代碼示例：

import torch
from sam2.sam2_video_predictor import SAM2VideoPredictor

predictor = SAM2VideoPredictor.from_pretrained("facebook/sam2-hiera-small")

with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
    state = predictor.init_state(<your_video>)

    # add new prompts and instantly get the output on the same frame
    frame_idx, object_ids, masks = predictor.add_new_points_or_box(state, <your_prompts>):

    # propagate the prompts to get masklets throughout the video
    for frame_idx, object_ids, masks in predictor.propagate_in_video(state):
        ...

詳細信息請參考演示筆記本。

📄 許可證

本項目採用 Apache-2.0 許可證。

📚 引用

如需引用本文、模型或軟件，請使用以下 BibTeX 格式：

@article{ravi2024sam2,
  title={SAM 2: Segment Anything in Images and Videos},
  author={Ravi, Nikhila and Gabeur, Valentin and Hu, Yuan-Ting and Hu, Ronghang and Ryali, Chaitanya and Ma, Tengyu and Khedr, Haitham and R{\"a}dle, Roman and Rolland, Chloe and Gustafson, Laura and Mintun, Eric and Pan, Junting and Alwala, Kalyan Vasudev and Carion, Nicolas and Wu, Chao-Yuan and Girshick, Ross and Doll{\'a}r, Piotr and Feichtenhofer, Christoph},
  journal={arXiv preprint arXiv:2408.00714},
  url={https://arxiv.org/abs/2408.00714},
  year={2024}
}