Model Selection

Swin Backbone Network

# Swin Backbone Network

Test Mask2former Swin Large Cityscapes Semantic

Large-scale Mask2Former model based on Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks, using a unified architecture for image segmentation tasks

Image Segmentation

Mask2former Deployment

A fine-tuned semantic segmentation model based on the Mask2Former framework, suitable for road scene understanding and autonomous driving applications

Image Segmentation

Video Mask2former Swin Large Youtubevis 2021 Instance

A video instance segmentation model trained on the YouTubeVIS-2021 dataset, utilizing the Swin Transformer backbone and Mask2Former unified segmentation architecture

Image Segmentation

Video Mask2former Swin Small Youtubevis 2021 Instance

Video Mask2Former model trained on the YouTubeVIS-2021 dataset for video instance segmentation tasks, using Swin Transformer as the backbone network.

Image Segmentation

Video Mask2former Swin Tiny Youtubevis 2019 Instance

A tiny video instance segmentation model trained on the YouTubeVIS-2019 dataset, utilizing the Swin Transformer backbone and Mask2Former unified segmentation architecture

Image Segmentation

Video Mask2former Swin Tiny Youtubevis 2021 Instance

A tiny video instance segmentation model trained on the YouTubeVIS-2021 dataset, utilizing a Swin Transformer backbone network

Image Segmentation

Mask2former Swin Base IN21k Coco Instance

Mask2Former is a Transformer-based universal image segmentation model, fine-tuned on the COCO dataset for instance segmentation tasks

Image Segmentation

Mask2former Swin Base IN21k Cityscapes Semantic

A general-purpose image segmentation model based on Swin Transformer, unifying instance/semantic/panoptic segmentation tasks

Image Segmentation

Upernet Swin Small

UperNet is a framework for semantic segmentation, utilizing Swin Transformer as the backbone network to achieve pixel-level semantic label prediction.

Image Segmentation

Transformers English

Upernet Swin Tiny

UperNet is a semantic segmentation framework that uses Swin Transformer as the backbone network, enabling pixel-level semantic label prediction.

Image Segmentation

Transformers English

Mask2former Swin Tiny Cityscapes Semantic

Mask2Former is a unified image segmentation framework capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks. This model is based on the Swin-Tiny backbone network and has been fine-tuned for semantic segmentation on the Cityscapes dataset.

Image Segmentation

Mask2former Swin Small Cityscapes Semantic

Small version of Mask2Former based on Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks

Image Segmentation

Mask2former Swin Base IN21k Cityscapes Panoptic

Mask2Former is a general-purpose image segmentation model based on Transformer architecture, capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks.

Image Segmentation

Mask2former Swin Base IN21k Cityscapes Instance

Mask2Former is a Transformer-based general-purpose image segmentation model that unifies instance, semantic, and panoptic segmentation tasks.

Image Segmentation

Mask2former Swin Tiny Ade Semantic

Mask2Former is a unified image segmentation model based on Transformer, capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks.

Image Segmentation

Mask2former Swin Large Ade Semantic

A large-scale version based on the Swin backbone network, trained on the ADE20k semantic segmentation dataset, employing a unified paradigm for image segmentation tasks.

Image Segmentation

Mask2former Swin Large Ade Panoptic

Mask2Former model trained on the ADE20k panoptic segmentation dataset using a Swin large backbone network, employing a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.

Image Segmentation

Mask2former Swin Tiny Cityscapes Instance

Mask2Former is a general-purpose image segmentation model based on Transformer architecture, this version is specifically fine-tuned for instance segmentation tasks on the Cityscapes dataset

Image Segmentation

Mask2former Swin Small Cityscapes Instance

Mask2Former is a unified image segmentation model based on Transformer, using mask attention mechanism to improve performance

Image Segmentation

Mask2former Swin Large Mapillary Vistas Semantic

A large-scale Mask2Former model based on the Swin backbone network, designed for general image segmentation tasks, unifying instance segmentation, semantic segmentation, and panoptic segmentation.

Image Segmentation

Mask2former Swin Large Cityscapes Semantic

A large-scale Mask2Former model based on the Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks, adopting a unified architecture for various image segmentation tasks.

Image Segmentation

Mask2former Swin Small Cityscapes Panoptic

A compact Mask2Former model based on Swin backbone network, optimized for panoptic segmentation tasks on the Cityscapes dataset

Image Segmentation

Mask2former Swin Large Cityscapes Panoptic

Mask2Former model based on Swin backbone network, specifically optimized and trained for panoptic segmentation tasks on the Cityscapes dataset

Image Segmentation

Mask2former Swin Tiny Cityscapes Panoptic

Mask2Former model based on Swin-Tiny backbone, optimized for Cityscapes panoptic segmentation tasks

Image Segmentation

Mask2former Swin Tiny Coco Panoptic

Mask2Former is a Transformer-based unified image segmentation model supporting instance segmentation, semantic segmentation, and panoptic segmentation tasks, utilizing masked attention mechanism to enhance performance

Image Segmentation

Mask2former Swin Small Coco Panoptic

A small-scale version of Mask2Former based on Swin backbone network, optimized for panoptic segmentation tasks on the COCO dataset

Image Segmentation

Mask2former Swin Large Coco Panoptic

A large-scale version of Mask2Former based on the Swin backbone network, specifically trained for panoptic segmentation tasks on the COCO dataset

Image Segmentation

Mask2former Swin Tiny Coco Instance

A mini version of the Mask2Former instance segmentation model trained on the COCO dataset, utilizing the Swin backbone network to handle segmentation tasks uniformly

Image Segmentation

Oneformer Coco Swin Large

OneFormer is the first multi-task universal image segmentation framework, achieving semantic segmentation, instance segmentation, and panoptic segmentation tasks with a single model

Image Segmentation

Oneformer Cityscapes Swin Large

The first multi-task universal image segmentation framework, supporting semantic/instance/panoptic segmentation tasks with a single model

Image Segmentation

Maskformer Swin Large Coco

Large-scale MaskFormer model based on Swin backbone network, unifying instance/semantic/panoptic segmentation tasks

Image Segmentation

Maskformer Swin Small Ade

A semantic segmentation model trained on the ADE20k dataset, using a unified framework to handle instance/semantic/panoptic segmentation tasks

Image Segmentation

Maskformer Swin Base Ade

MaskFormer semantic segmentation model trained on the ADE20k dataset, using a Swin backbone network to unify instance/semantic/panoptic segmentation tasks

Image Segmentation

Maskformer Swin Tiny Ade

A semantic segmentation model trained on the ADE20k dataset, using a unified framework to handle instance/semantic/panoptic segmentation tasks

Image Segmentation

Maskformer Swin Small Coco

A small MaskFormer model based on the Swin backbone network, trained on the COCO dataset for panoptic segmentation tasks.

Image Segmentation

Maskformer Swin Large Ade

Semantic segmentation model trained on the ADE20k dataset, using a unified framework for instance segmentation, semantic segmentation, and panoptic segmentation tasks

Image Segmentation

Maskformer Swin Base Coco

A panoptic segmentation model based on the Swin backbone network, trained on the COCO dataset, unifying instance/semantic/panoptic segmentation tasks

Image Segmentation

Maskformer Swin Tiny Coco

A panoptic segmentation model trained on the COCO dataset, using a unified paradigm to handle instance/semantic/panoptic segmentation tasks

Image Segmentation

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase