Mask2former Swin Large Ade Panoptic
Mask2Former model trained on the ADE20k panoptic segmentation dataset using a Swin large backbone network, employing a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Downloads 2,625
Release Time : 1/5/2023
Model Overview
Mask2Former is a universal image segmentation model that unifies instance segmentation, semantic segmentation, and panoptic segmentation by predicting a set of masks and their corresponding labels, treating all three tasks as instance segmentation problems.
Model Features
Unified Segmentation Paradigm
By predicting a set of masks and their corresponding labels, it unifies instance segmentation, semantic segmentation, and panoptic segmentation as instance segmentation problems.
Multi-scale Deformable Attention
Uses a multi-scale deformable attention Transformer to upgrade the pixel decoder, improving model performance.
Masked Attention Mechanism
Introduces a Transformer decoder with a masked attention mechanism, enhancing performance without increasing computational cost.
Efficient Training
Significantly improves training efficiency by computing losses on subsampled points.
Model Capabilities
Image Segmentation
Instance Segmentation
Semantic Segmentation
Panoptic Segmentation
Use Cases
Computer Vision
Scene Understanding
Used to understand objects and their relationships in complex scenes
Autonomous Driving
Used for object recognition and segmentation in road scenes
Featured Recommended AI Models
Š 2025AIbase