Oneformer Ade20k Dinat Large
The first multi-task universal image segmentation framework supporting semantic/instance/panoptic segmentation with a single model
Downloads 2,275
Release Time : 11/15/2022
Model Overview
OneFormer is a Transformer-based universal image segmentation model that achieves semantic segmentation, instance segmentation, and panoptic segmentation through a single architecture and training process, trained on the ADE20k dataset.
Model Features
Unified Multi-task Architecture
A single model simultaneously supports semantic segmentation, instance segmentation, and panoptic segmentation
Dynamic Task Adaptation
Implements task guidance during training and dynamic task switching during inference through task token mechanism
Outperforms Specialized Models
Surpasses performance of specially designed single-task models across multiple segmentation tasks
Model Capabilities
Semantic Segmentation
Instance Segmentation
Panoptic Segmentation
Scene Parsing
Object Recognition
Use Cases
Computer Vision
Scene Understanding
Pixel-level semantic parsing of indoor/outdoor scenes
Can recognize 150 categories of scene elements (based on ADE20k dataset)
Object Instance Segmentation
Identify and segment individual object instances in images
Capable of handling overlapping objects in complex scenes
Autonomous Driving
Road Scene Parsing
Real-time segmentation of road elements, vehicles, pedestrians, etc.
Suitable for environmental perception modules in autonomous driving systems
Featured Recommended AI Models
Š 2025AIbase