Maskformer Swin Base Coco
A panoptic segmentation model based on the Swin backbone network, trained on the COCO dataset, unifying instance/semantic/panoptic segmentation tasks
Downloads 3,855
Release Time : 3/2/2022
Model Overview
MaskFormer unifies segmentation tasks by predicting a set of masks and their corresponding labels, treating all segmentation as instance segmentation. This checkpoint is optimized for semantic segmentation tasks.
Model Features
Unified Segmentation Paradigm
Unifies instance/semantic/panoptic segmentation as a mask prediction problem, simplifying task processing
Swin Backbone Network
Uses the efficient Swin Transformer as the feature extraction backbone, balancing global context and local details
End-to-End Training
Directly predicts binary masks and class labels without relying on ROI operations or post-processing grouping
Model Capabilities
Image Semantic Segmentation
Instance-Level Object Recognition
Panoptic Scene Parsing
Use Cases
Computer Vision
Scene Understanding
Performs pixel-level classification and segmentation of objects in complex scenes
Can output segmentation mask images with semantic labels
Autonomous Driving
Real-time parsing of drivable areas, vehicles, and pedestrians in road scenes
Featured Recommended AI Models
Š 2025AIbase