M

Mask2former Swin Small Coco Instance

Developed by facebook
Mask2Former is a unified image segmentation model based on Transformer, fine-tuned on the COCO dataset for instance segmentation tasks
Downloads 17.51k
Release Time : 12/26/2022

Model Overview

Adopts a unified paradigm to handle instance/semantic/panoptic segmentation tasks by predicting mask groups and corresponding labels, with improved performance and efficiency compared to its predecessor MaskFormer

Model Features

Unified Segmentation Architecture
Treats instance/semantic/panoptic segmentation uniformly as instance segmentation, simplifying the task flow
Multi-scale Deformable Attention
Replaces traditional pixel decoders to improve feature extraction efficiency
Masked Attention Mechanism
Enhances model performance without increasing computational load
Efficient Training Strategy
Significantly reduces training resource consumption by computing loss from sampled points rather than entire masks

Model Capabilities

Image Instance Segmentation
Object Mask Prediction
Multi-category Object Recognition

Use Cases

Computer Vision
Object Recognition and Segmentation
Identifies objects in images and generates precise segmentation masks
Achieves SOTA performance on the COCO dataset
Scene Understanding
Analyzes object distribution and spatial relationships in complex scenes
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase