O

Oneformer Ade20k Dinat Large

Developed by shi-labs
The first multi-task universal image segmentation framework supporting semantic/instance/panoptic segmentation with a single model
Downloads 2,275
Release Time : 11/15/2022

Model Overview

OneFormer is a Transformer-based universal image segmentation model that achieves semantic segmentation, instance segmentation, and panoptic segmentation through a single architecture and training process, trained on the ADE20k dataset.

Model Features

Unified Multi-task Architecture
A single model simultaneously supports semantic segmentation, instance segmentation, and panoptic segmentation
Dynamic Task Adaptation
Implements task guidance during training and dynamic task switching during inference through task token mechanism
Outperforms Specialized Models
Surpasses performance of specially designed single-task models across multiple segmentation tasks

Model Capabilities

Semantic Segmentation
Instance Segmentation
Panoptic Segmentation
Scene Parsing
Object Recognition

Use Cases

Computer Vision
Scene Understanding
Pixel-level semantic parsing of indoor/outdoor scenes
Can recognize 150 categories of scene elements (based on ADE20k dataset)
Object Instance Segmentation
Identify and segment individual object instances in images
Capable of handling overlapping objects in complex scenes
Autonomous Driving
Road Scene Parsing
Real-time segmentation of road elements, vehicles, pedestrians, etc.
Suitable for environmental perception modules in autonomous driving systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase