M

Mask2former Swin Tiny Coco Panoptic

Developed by facebook
Mask2Former is a Transformer-based unified image segmentation model supporting instance segmentation, semantic segmentation, and panoptic segmentation tasks, utilizing masked attention mechanism to enhance performance
Downloads 4,538
Release Time : 1/2/2023

Model Overview

Mask2Former adopts a unified paradigm for image segmentation tasks, achieving instance/semantic/panoptic segmentation by predicting a set of masks and corresponding labels. Compared to previous models, its innovation lies in multi-scale deformable attention mechanism and masked-attention decoder

Model Features

Unified Segmentation Architecture
Unifies instance/semantic/panoptic segmentation as mask prediction problems, simplifying task processing workflow
Masked Attention Mechanism
Employs masked-attention Transformer decoder to improve performance without increasing computational cost
Efficient Training Strategy
Significantly improves training efficiency by calculating loss through sampled points rather than entire masks

Model Capabilities

Image Segmentation
Instance Segmentation
Semantic Segmentation
Panoptic Segmentation

Use Cases

Computer Vision
Scene Understanding
Pixel-level identification and classification of objects in complex scenes
Can output segmentation masks with semantic labels
Autonomous Driving
Identifying various objects and drivable areas in road scenes
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase