S

Segformer B0 Finetuned Ade 512 512

Developed by nvidia
SegFormer is a Transformer-based semantic segmentation model fine-tuned on the ADE20k dataset, suitable for 512x512 resolution image segmentation tasks.
Downloads 179.04k
Release Time : 3/2/2022

Model Overview

This model adopts a hierarchical Transformer encoder with a lightweight all-MLP decoder head architecture, specifically designed for semantic segmentation tasks, demonstrating excellent performance on benchmarks like ADE20K.

Model Features

Hierarchical Transformer encoder
Adopts a hierarchical Transformer architecture to effectively capture multi-scale features
Lightweight MLP decoder head
Uses an all-MLP designed lightweight decoder head to improve inference efficiency
512x512 resolution support
Specifically optimized for 512x512 resolution images

Model Capabilities

Image semantic segmentation
Scene parsing
Pixel-level classification

Use Cases

Scene understanding
House scene parsing
Performs semantic segmentation on house images to identify different architectural elements
Castle scene parsing
Performs semantic segmentation on castle images to identify different architectural features
Urban planning
Urban landscape analysis
Analyzes urban landscape images to identify elements like roads, buildings, green spaces, etc.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase