开源免费的sam2.1-hiera-small模型 - 高效提示图像与视频分割应用

首页

Sam2.1 Hiera Small

由 facebook 开发

SAM 2是FAIR研发的面向图像与视频可提示视觉分割的基础模型，支持通过提示进行高效分割。

图像分割开源协议:Apache-2.0 #可提示视觉分割 #图像视频通用 #实时掩码生成

下载量 7,333

发布时间 : 9/24/2024

模型简介

SAM 2是一个通用的视觉分割模型，能够在图像和视频中根据用户提供的提示（如点或框）生成高质量的分割掩码。

模型特点

多模态提示支持

支持通过点、框等多种提示方式进行交互式分割

视频分割能力

独特的状态管理机制可实现视频中的时序一致性分割

高效推理

支持混合精度(bfloat16)推理，优化计算效率

模型能力

图像分割

视频分割

交互式分割

掩码生成

使用案例

计算机视觉

图像编辑

快速分离图像中的对象进行编辑

高质量对象掩码

视频分析

跟踪视频中的对象运动

时序一致的视频对象分割

医学影像

医学图像分割

分割CT/MRI扫描中的器官或病变区域

🚀 SAM 2：图像和视频中的任意分割

SAM 2 是由 FAIR 开发的基础模型，旨在解决图像和视频中的可提示视觉分割问题。它为图像和视频分割任务提供了强大的支持。更多信息请参阅 SAM 2 论文。

官方代码已在这个仓库中公开。

🚀 快速开始

💻 使用示例

基础用法

以下是图像预测的代码示例：

import torch
from sam2.sam2_image_predictor import SAM2ImagePredictor

predictor = SAM2ImagePredictor.from_pretrained("facebook/sam2-hiera-small")

with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
    predictor.set_image(<your_image>)
    masks, _, _ = predictor.predict(<input_prompts>)

高级用法

以下是视频预测的代码示例：

import torch
from sam2.sam2_video_predictor import SAM2VideoPredictor

predictor = SAM2VideoPredictor.from_pretrained("facebook/sam2-hiera-small")

with torch.inference_mode(), torch.autocast("cuda", dtype=torch.bfloat16):
    state = predictor.init_state(<your_video>)

    # add new prompts and instantly get the output on the same frame
    frame_idx, object_ids, masks = predictor.add_new_points_or_box(state, <your_prompts>):

    # propagate the prompts to get masklets throughout the video
    for frame_idx, object_ids, masks in predictor.propagate_in_video(state):
        ...

详细信息请参考演示笔记本。

📄 许可证

本项目采用 Apache-2.0 许可证。

📚 引用

如需引用该论文、模型或软件，请使用以下 BibTeX 格式：

@article{ravi2024sam2,
  title={SAM 2: Segment Anything in Images and Videos},
  author={Ravi, Nikhila and Gabeur, Valentin and Hu, Yuan-Ting and Hu, Ronghang and Ryali, Chaitanya and Ma, Tengyu and Khedr, Haitham and R{\"a}dle, Roman and Rolland, Chloe and Gustafson, Laura and Mintun, Eric and Pan, Junting and Alwala, Kalyan Vasudev and Carion, Nicolas and Wu, Chao-Yuan and Girshick, Ross and Doll{\'a}r, Piotr and Feichtenhofer, Christoph},
  journal={arXiv preprint arXiv:2408.00714},
  url={https://arxiv.org/abs/2408.00714},
  year={2024}
}