M

Mask2former Swin Tiny Coco Instance

Developed by facebook
A mini version of the Mask2Former instance segmentation model trained on the COCO dataset, utilizing the Swin backbone network to handle segmentation tasks uniformly
Downloads 149.85k
Release Time : 12/23/2022

Model Overview

Mask2Former is a universal image segmentation model that processes instance segmentation, semantic segmentation, and panoptic segmentation tasks by predicting a set of masks and corresponding labels. It offers improvements in both performance and efficiency compared to previous models.

Model Features

Unified Segmentation Paradigm
Treats instance segmentation, semantic segmentation, and panoptic segmentation uniformly as instance segmentation tasks
Efficient Attention Mechanism
Uses a multi-scale deformable attention Transformer to replace traditional pixel decoders
Masked Attention Decoder
Introduces a Transformer decoder with masked attention to enhance performance without increasing computational load
Efficient Training Method
Calculates loss via sampled points rather than entire masks, significantly improving training efficiency

Model Capabilities

Image Segmentation
Instance Recognition
Object Mask Generation

Use Cases

Computer Vision
Object Recognition and Segmentation
Identifies objects in images and generates precise pixel-level segmentation masks
Achieves high-precision instance segmentation on the COCO dataset
Scene Understanding
Analyzes multiple objects and their spatial relationships in complex scenes
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase