M

Maskformer Swin Base Coco

Developed by facebook
A panoptic segmentation model based on the Swin backbone network, trained on the COCO dataset, unifying instance/semantic/panoptic segmentation tasks
Downloads 3,855
Release Time : 3/2/2022

Model Overview

MaskFormer unifies segmentation tasks by predicting a set of masks and their corresponding labels, treating all segmentation as instance segmentation. This checkpoint is optimized for semantic segmentation tasks.

Model Features

Unified Segmentation Paradigm
Unifies instance/semantic/panoptic segmentation as a mask prediction problem, simplifying task processing
Swin Backbone Network
Uses the efficient Swin Transformer as the feature extraction backbone, balancing global context and local details
End-to-End Training
Directly predicts binary masks and class labels without relying on ROI operations or post-processing grouping

Model Capabilities

Image Semantic Segmentation
Instance-Level Object Recognition
Panoptic Scene Parsing

Use Cases

Computer Vision
Scene Understanding
Performs pixel-level classification and segmentation of objects in complex scenes
Can output segmentation mask images with semantic labels
Autonomous Driving
Real-time parsing of drivable areas, vehicles, and pedestrians in road scenes
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase