sam2-hiera-base-plus开源模型 - 免费部署，支持图像和视频提示式高效分割

首页

Sam2 Hiera Base Plus

由 facebook 开发

SAM 2是FAIR研发的面向图像和视频可提示视觉分割的基础模型，支持通过提示进行高效分割。

图像分割开源协议:Apache-2.0 #可提示分割 #视频对象跟踪 #多模态输入

下载量 18.17k

发布时间 : 8/2/2024

模型简介

SAM 2是一个用于图像和视频分割的基础模型，能够根据用户提供的提示（如点或框）快速生成高质量的分割掩码。

模型特点

可提示分割

支持通过点、框等提示方式进行交互式分割

视频分割

能够处理视频序列，支持跨帧的掩码传播

高效推理

使用bfloat16精度和CUDA加速实现高效推理

模型能力

图像分割

视频分割

交互式分割

掩码生成

使用案例

计算机视觉

图像编辑

快速分离图像中的对象进行编辑

高质量的对象分割掩码

视频分析

跟踪视频中的对象运动

跨帧一致的对象分割

🚀 SAM 2：图像和视频中的任意分割模型

SAM 2 是由 FAIR 开发的基础模型，旨在解决图像和视频中的可提示视觉分割问题。它能够根据用户的提示，在图像和视频中实现灵活的分割任务。更多信息请参考 SAM 2 论文。

官方代码已在这个仓库中公开。

🚀 快速开始

本项目提供了在图像和视频中进行分割预测的功能，以下是具体的使用方法。

💻 使用示例

基础用法

图像预测

import torch
from sam2.sam2_image_predictor import SAM2ImagePredictor

predictor = SAM2ImagePredictor.from_pretrained("facebook/sam2-hiera-base-plus")

with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
    predictor.set_image(<your_image>)
    masks, _, _ = predictor.predict(<input_prompts>)

视频预测

import torch
from sam2.sam2_video_predictor import SAM2VideoPredictor

predictor = SAM2VideoPredictor.from_pretrained("facebook/sam2-hiera-base-plus")

with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
    state = predictor.init_state(<your_video>)

    # add new prompts and instantly get the output on the same frame
    frame_idx, object_ids, masks = predictor.add_new_points_or_box(state, <your_prompts>):

    # propagate the prompts to get masklets throughout the video
    for frame_idx, object_ids, masks in predictor.propagate_in_video(state):
        ...

更多详细信息请参考演示笔记本。

引用

如果您想引用该论文、模型或软件，请使用以下 BibTeX 格式：

@article{ravi2024sam2,
  title={SAM 2: Segment Anything in Images and Videos},
  author={Ravi, Nikhila and Gabeur, Valentin and Hu, Yuan-Ting and Hu, Ronghang and Ryali, Chaitanya and Ma, Tengyu and Khedr, Haitham and R{\"a}dle, Roman and Rolland, Chloe and Gustafson, Laura and Mintun, Eric and Pan, Junting and Alwala, Kalyan Vasudev and Carion, Nicolas and Wu, Chao-Yuan and Girshick, Ross and Doll{\'a}r, Piotr and Feichtenhofer, Christoph},
  journal={arXiv preprint arXiv:2408.00714},
  url={https://arxiv.org/abs/2408.00714},
  year={2024}
}