Segformer B2 1024x1024 City 160k
A semantic segmentation model based on the Segformer architecture, specifically optimized for the Cityscapes dataset
Downloads 651
Release Time : 11/29/2024
Model Overview
This is a PyTorch-implemented Segformer model designed for semantic segmentation tasks in urban street scenes. The model uses MIT-B2 as the encoder, trained at 1024x1024 resolution, suitable for fine-grained segmentation in urban scenarios.
Model Features
Efficient Segmentation Architecture
Utilizes the Segformer architecture, combining the advantages of Transformers with efficient segmentation performance
High-Resolution Processing
Supports high-resolution inputs of 1024x1024, ideal for fine-grained segmentation in urban scenes
Pre-trained Model
Provides model weights pre-trained on the Cityscapes dataset, ready for direct inference
Model Capabilities
Urban scene semantic segmentation
Pixel-level classification
Urban scene understanding
Use Cases
Intelligent Transportation
Road Scene Parsing
Identifies traffic elements such as roads, vehicles, and pedestrians
Can be used for environmental perception in autonomous driving systems
Urban Planning
Urban Infrastructure Analysis
Identifies urban elements like buildings, roads, and green belts
Assists in urban planning decision-making
Featured Recommended AI Models
Š 2025AIbase