O

Oneformer Cityscapes Swin Large

Developed by shi-labs
The first multi-task universal image segmentation framework, supporting semantic/instance/panoptic segmentation tasks with a single model
Downloads 1,784
Release Time : 11/15/2022

Model Overview

A unified image segmentation model based on Swin backbone network, enabling dynamic task switching through task tokens, trained on the Cityscapes dataset

Model Features

Unified Multi-task Architecture
A single model simultaneously supports semantic segmentation, instance segmentation, and panoptic segmentation tasks
Dynamic Task Switching
Task switching during inference via task tokens without retraining dedicated models
Outperforms Dedicated Models
Surpasses traditional dedicated models in performance across all three segmentation tasks

Model Capabilities

Semantic Segmentation
Instance Segmentation
Panoptic Segmentation
Street Scene Parsing

Use Cases

Autonomous Driving
Road Scene Understanding
Identifying elements such as vehicles, pedestrians, and traffic signs in urban road scenes
Provides pixel-level semantic labels and instance boundaries
Geographic Information Systems
Aerial Image Analysis
Segmenting features like buildings and roads in satellite/aerial images
Generates quantifiable geographic data
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase