M

Mask2former Swin Large Ade Panoptic

Developed by facebook
Mask2Former model trained on the ADE20k panoptic segmentation dataset using a Swin large backbone network, employing a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Downloads 2,625
Release Time : 1/5/2023

Model Overview

Mask2Former is a universal image segmentation model that unifies instance segmentation, semantic segmentation, and panoptic segmentation by predicting a set of masks and their corresponding labels, treating all three tasks as instance segmentation problems.

Model Features

Unified Segmentation Paradigm
By predicting a set of masks and their corresponding labels, it unifies instance segmentation, semantic segmentation, and panoptic segmentation as instance segmentation problems.
Multi-scale Deformable Attention
Uses a multi-scale deformable attention Transformer to upgrade the pixel decoder, improving model performance.
Masked Attention Mechanism
Introduces a Transformer decoder with a masked attention mechanism, enhancing performance without increasing computational cost.
Efficient Training
Significantly improves training efficiency by computing losses on subsampled points.

Model Capabilities

Image Segmentation
Instance Segmentation
Semantic Segmentation
Panoptic Segmentation

Use Cases

Computer Vision
Scene Understanding
Used to understand objects and their relationships in complex scenes
Autonomous Driving
Used for object recognition and segmentation in road scenes
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase