M

Mask2former Swin Large Ade Semantic

Developed by facebook
A large-scale version based on the Swin backbone network, trained on the ADE20k semantic segmentation dataset, employing a unified paradigm for image segmentation tasks.
Downloads 238.92k
Release Time : 1/5/2023

Model Overview

Mask2Former is an advanced image segmentation model capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks. It unifies the processing of different types of segmentation tasks by predicting a set of masks and their corresponding labels.

Model Features

Unified Segmentation Paradigm
Unifies instance segmentation, semantic segmentation, and panoptic segmentation as instance segmentation tasks.
Efficient Attention Mechanism
Uses multi-scale deformable attention Transformer to replace traditional pixel decoders.
Masked Attention Decoder
Introduces a Transformer decoder with masked attention to improve performance without increasing computational load.
Efficient Training Method
Significantly improves training efficiency by calculating loss via sampled points rather than entire masks.

Model Capabilities

Image Semantic Segmentation
Instance Segmentation
Panoptic Segmentation
Multi-scale Feature Extraction

Use Cases

Computer Vision
Scene Understanding
Accurate segmentation and classification of objects in complex scenes.
Achieves SOTA performance on standard datasets like ADE20k.
Autonomous Driving
Recognition and segmentation of various objects in road scenes.
Medical Image Analysis
Segmentation of organs or lesion areas in medical images.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase