# Scene Parsing

Segformer B0 Scene Parse 150
Other
Lightweight image segmentation model based on MIT-B0 architecture, optimized for scene parsing tasks
Image Segmentation Transformers
S
univers1123
20
0
Upernet Swin Small
MIT
UperNet is a framework for semantic segmentation, utilizing Swin Transformer as the backbone network to achieve pixel-level semantic label prediction.
Image Segmentation Transformers English
U
openmmlab
1,467
5
Upernet Convnext Large
MIT
UperNet is a semantic segmentation framework combined with the ConvNeXt large backbone network for pixel-level semantic label prediction.
Image Segmentation Transformers English
U
openmmlab
23.09k
0
Upernet Convnext Small
MIT
UperNet is a framework for semantic segmentation that uses ConvNeXt as its backbone network, enabling pixel-level semantic label prediction.
Image Segmentation Transformers English
U
openmmlab
43.31k
31
Smallcap7m
A model capable of converting image content into textual descriptions, suitable for various vision-language tasks.
Image-to-Text Transformers English
S
Yova
977
5
Segformer B2 Finetuned Ade 512 512
Other
SegFormer is a Transformer-based semantic segmentation model fine-tuned on the ADE20k dataset, suitable for image segmentation tasks at 512x512 resolution.
Image Segmentation Transformers
S
nvidia
44.07k
3
Segformer B5 Finetuned Ade 640 640
Other
SegFormer is a Transformer-based semantic segmentation model fine-tuned on the ADE20k dataset, suitable for image segmentation tasks.
Image Segmentation Transformers
S
nvidia
42.32k
39
Maskformer Swin Large Ade
Other
Semantic segmentation model trained on the ADE20k dataset, using a unified framework for instance segmentation, semantic segmentation, and panoptic segmentation tasks
Image Segmentation Transformers
M
facebook
4,708
57
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase