# Swin Backbone Network
Test Mask2former Swin Large Cityscapes Semantic
Other
Large-scale Mask2Former model based on Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks, using a unified architecture for image segmentation tasks
Image Segmentation
T
kroixy
22
0
Mask2former Deployment
Other
A fine-tuned semantic segmentation model based on the Mask2Former framework, suitable for road scene understanding and autonomous driving applications
Image Segmentation
Safetensors
M
saninmohammedn
229
1
Video Mask2former Swin Large Youtubevis 2021 Instance
MIT
A video instance segmentation model trained on the YouTubeVIS-2021 dataset, utilizing the Swin Transformer backbone and Mask2Former unified segmentation architecture
Image Segmentation
Transformers

V
shivalikasingh
52
1
Video Mask2former Swin Small Youtubevis 2021 Instance
MIT
Video Mask2Former model trained on the YouTubeVIS-2021 dataset for video instance segmentation tasks, using Swin Transformer as the backbone network.
Image Segmentation
Transformers

V
shivalikasingh
18
0
Video Mask2former Swin Tiny Youtubevis 2019 Instance
MIT
A tiny video instance segmentation model trained on the YouTubeVIS-2019 dataset, utilizing the Swin Transformer backbone and Mask2Former unified segmentation architecture
Image Segmentation
Transformers

V
shivalikasingh
19
0
Video Mask2former Swin Tiny Youtubevis 2021 Instance
MIT
A tiny video instance segmentation model trained on the YouTubeVIS-2021 dataset, utilizing a Swin Transformer backbone network
Image Segmentation
Transformers

V
shivalikasingh
22
2
Mask2former Swin Base IN21k Coco Instance
Other
Mask2Former is a Transformer-based universal image segmentation model, fine-tuned on the COCO dataset for instance segmentation tasks
Image Segmentation
Transformers

M
facebook
26
0
Mask2former Swin Base IN21k Cityscapes Semantic
Other
A general-purpose image segmentation model based on Swin Transformer, unifying instance/semantic/panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
329
0
Upernet Swin Small
MIT
UperNet is a framework for semantic segmentation, utilizing Swin Transformer as the backbone network to achieve pixel-level semantic label prediction.
Image Segmentation
Transformers English

U
openmmlab
1,467
5
Upernet Swin Tiny
MIT
UperNet is a semantic segmentation framework that uses Swin Transformer as the backbone network, enabling pixel-level semantic label prediction.
Image Segmentation
Transformers English

U
openmmlab
4,682
3
Mask2former Swin Tiny Cityscapes Semantic
Other
Mask2Former is a unified image segmentation framework capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks. This model is based on the Swin-Tiny backbone network and has been fine-tuned for semantic segmentation on the Cityscapes dataset.
Image Segmentation
Transformers

M
facebook
55.98k
3
Mask2former Swin Small Cityscapes Semantic
Other
Small version of Mask2Former based on Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks
Image Segmentation
Transformers

M
facebook
952
2
Mask2former Swin Base IN21k Cityscapes Panoptic
Other
Mask2Former is a general-purpose image segmentation model based on Transformer architecture, capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Image Segmentation
Transformers

M
facebook
140
0
Mask2former Swin Base IN21k Cityscapes Instance
Other
Mask2Former is a Transformer-based general-purpose image segmentation model that unifies instance, semantic, and panoptic segmentation tasks.
Image Segmentation
Transformers

M
facebook
53
0
Mask2former Swin Tiny Ade Semantic
Other
Mask2Former is a unified image segmentation model based on Transformer, capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Image Segmentation
Transformers

M
facebook
7,834
1
Mask2former Swin Large Ade Semantic
Other
A large-scale version based on the Swin backbone network, trained on the ADE20k semantic segmentation dataset, employing a unified paradigm for image segmentation tasks.
Image Segmentation
Transformers

M
facebook
238.92k
15
Mask2former Swin Large Ade Panoptic
Other
Mask2Former model trained on the ADE20k panoptic segmentation dataset using a Swin large backbone network, employing a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Image Segmentation
Transformers

M
facebook
2,625
4
Mask2former Swin Tiny Cityscapes Instance
Other
Mask2Former is a general-purpose image segmentation model based on Transformer architecture, this version is specifically fine-tuned for instance segmentation tasks on the Cityscapes dataset
Image Segmentation
Transformers

M
facebook
67
0
Mask2former Swin Small Cityscapes Instance
Other
Mask2Former is a unified image segmentation model based on Transformer, using mask attention mechanism to improve performance
Image Segmentation
Transformers

M
facebook
43
1
Mask2former Swin Large Mapillary Vistas Semantic
Other
A large-scale Mask2Former model based on the Swin backbone network, designed for general image segmentation tasks, unifying instance segmentation, semantic segmentation, and panoptic segmentation.
Image Segmentation
Transformers

M
facebook
5,539
3
Mask2former Swin Large Cityscapes Semantic
Other
A large-scale Mask2Former model based on the Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks, adopting a unified architecture for various image segmentation tasks.
Image Segmentation
Transformers

M
facebook
296.33k
24
Mask2former Swin Small Cityscapes Panoptic
Other
A compact Mask2Former model based on Swin backbone network, optimized for panoptic segmentation tasks on the Cityscapes dataset
Image Segmentation
Transformers

M
facebook
568
0
Mask2former Swin Large Cityscapes Panoptic
Other
Mask2Former model based on Swin backbone network, specifically optimized and trained for panoptic segmentation tasks on the Cityscapes dataset
Image Segmentation
Transformers

M
facebook
772
2
Mask2former Swin Tiny Cityscapes Panoptic
Other
Mask2Former model based on Swin-Tiny backbone, optimized for Cityscapes panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
2,126
0
Mask2former Swin Tiny Coco Panoptic
Other
Mask2Former is a Transformer-based unified image segmentation model supporting instance segmentation, semantic segmentation, and panoptic segmentation tasks, utilizing masked attention mechanism to enhance performance
Image Segmentation
Transformers

M
facebook
4,538
8
Mask2former Swin Small Coco Panoptic
Other
A small-scale version of Mask2Former based on Swin backbone network, optimized for panoptic segmentation tasks on the COCO dataset
Image Segmentation
Transformers

M
facebook
240
1
Mask2former Swin Large Coco Panoptic
Other
A large-scale version of Mask2Former based on the Swin backbone network, specifically trained for panoptic segmentation tasks on the COCO dataset
Image Segmentation
Transformers

M
facebook
37.67k
30
Mask2former Swin Tiny Coco Instance
Other
A mini version of the Mask2Former instance segmentation model trained on the COCO dataset, utilizing the Swin backbone network to handle segmentation tasks uniformly
Image Segmentation
Transformers

M
facebook
149.85k
7
Oneformer Coco Swin Large
MIT
OneFormer is the first multi-task universal image segmentation framework, achieving semantic segmentation, instance segmentation, and panoptic segmentation tasks with a single model
Image Segmentation
Transformers

O
shi-labs
165.70k
3
Oneformer Cityscapes Swin Large
MIT
The first multi-task universal image segmentation framework, supporting semantic/instance/panoptic segmentation tasks with a single model
Image Segmentation
Transformers

O
shi-labs
1,784
2
Maskformer Swin Large Coco
Other
Large-scale MaskFormer model based on Swin backbone network, unifying instance/semantic/panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
849
24
Maskformer Swin Small Ade
Other
A semantic segmentation model trained on the ADE20k dataset, using a unified framework to handle instance/semantic/panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
205
2
Maskformer Swin Base Ade
Other
MaskFormer semantic segmentation model trained on the ADE20k dataset, using a Swin backbone network to unify instance/semantic/panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
5,670
11
Maskformer Swin Tiny Ade
Other
A semantic segmentation model trained on the ADE20k dataset, using a unified framework to handle instance/semantic/panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
5,196
5
Maskformer Swin Small Coco
Other
A small MaskFormer model based on the Swin backbone network, trained on the COCO dataset for panoptic segmentation tasks.
Image Segmentation
Transformers

M
facebook
2,293
3
Maskformer Swin Large Ade
Other
Semantic segmentation model trained on the ADE20k dataset, using a unified framework for instance segmentation, semantic segmentation, and panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
4,708
57
Maskformer Swin Base Coco
Other
A panoptic segmentation model based on the Swin backbone network, trained on the COCO dataset, unifying instance/semantic/panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
3,855
24
Maskformer Swin Tiny Coco
Other
A panoptic segmentation model trained on the COCO dataset, using a unified paradigm to handle instance/semantic/panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
301
6
Featured Recommended AI Models