sam2-hiera-base-plus開源模型 - 免費部署，支持圖像和視頻提示式高效分割

首頁

Sam2 Hiera Base Plus

由facebook開發

SAM 2是FAIR研發的面向圖像和視頻可提示視覺分割的基礎模型，支持通過提示進行高效分割。

圖像分割開源協議:Apache-2.0 #可提示分割 #視頻對象跟蹤 #多模態輸入

下載量 18.17k

發布時間 : 8/2/2024

模型概述

SAM 2是一個用於圖像和視頻分割的基礎模型，能夠根據用戶提供的提示（如點或框）快速生成高質量的分割掩碼。

模型特點

可提示分割

支持通過點、框等提示方式進行交互式分割

視頻分割

能夠處理視頻序列，支持跨幀的掩碼傳播

高效推理

使用bfloat16精度和CUDA加速實現高效推理

模型能力

圖像分割

視頻分割

交互式分割

掩碼生成

使用案例

計算機視覺

圖像編輯

快速分離圖像中的對象進行編輯

高質量的對象分割掩碼

視頻分析

跟蹤視頻中的對象運動

跨幀一致的對象分割

🚀 SAM 2：圖像和視頻中的任意分割模型

SAM 2 是由 FAIR 開發的基礎模型，旨在解決圖像和視頻中的可提示視覺分割問題。它能夠根據用戶的提示，在圖像和視頻中實現靈活的分割任務。更多信息請參考 SAM 2 論文。

官方代碼已在這個倉庫中公開。

🚀 快速開始

本項目提供了在圖像和視頻中進行分割預測的功能，以下是具體的使用方法。

💻 使用示例

基礎用法

圖像預測

import torch
from sam2.sam2_image_predictor import SAM2ImagePredictor

predictor = SAM2ImagePredictor.from_pretrained("facebook/sam2-hiera-base-plus")

with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
    predictor.set_image(<your_image>)
    masks, _, _ = predictor.predict(<input_prompts>)

視頻預測

import torch
from sam2.sam2_video_predictor import SAM2VideoPredictor

predictor = SAM2VideoPredictor.from_pretrained("facebook/sam2-hiera-base-plus")

with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
    state = predictor.init_state(<your_video>)

    # add new prompts and instantly get the output on the same frame
    frame_idx, object_ids, masks = predictor.add_new_points_or_box(state, <your_prompts>):

    # propagate the prompts to get masklets throughout the video
    for frame_idx, object_ids, masks in predictor.propagate_in_video(state):
        ...

更多詳細信息請參考演示筆記本。

引用

如果您想引用該論文、模型或軟件，請使用以下 BibTeX 格式：

@article{ravi2024sam2,
  title={SAM 2: Segment Anything in Images and Videos},
  author={Ravi, Nikhila and Gabeur, Valentin and Hu, Yuan-Ting and Hu, Ronghang and Ryali, Chaitanya and Ma, Tengyu and Khedr, Haitham and R{\"a}dle, Roman and Rolland, Chloe and Gustafson, Laura and Mintun, Eric and Pan, Junting and Alwala, Kalyan Vasudev and Carion, Nicolas and Wu, Chao-Yuan and Girshick, Ross and Doll{\'a}r, Piotr and Feichtenhofer, Christoph},
  journal={arXiv preprint arXiv:2408.00714},
  url={https://arxiv.org/abs/2408.00714},
  year={2024}
}