Segformer-b2 Open-Source Semantic Segmentation Model - Free Deployment, Optimized for Cityscapes Dataset

Home

Segformer B2 1024x1024 City 160k

Developed by smp-hub

A semantic segmentation model based on the Segformer architecture, specifically optimized for the Cityscapes dataset

Image Segmentation

Safetensors

Open Source License:Other #Urban Scene Segmentation #High-Resolution Image Processing #Semantic Segmentation

Downloads 651

Release Time : 11/29/2024

Model Overview

This is a PyTorch-implemented Segformer model designed for semantic segmentation tasks in urban street scenes. The model uses MIT-B2 as the encoder, trained at 1024x1024 resolution, suitable for fine-grained segmentation in urban scenarios.

Model Features

Efficient Segmentation Architecture

Utilizes the Segformer architecture, combining the advantages of Transformers with efficient segmentation performance

High-Resolution Processing

Supports high-resolution inputs of 1024x1024, ideal for fine-grained segmentation in urban scenes

Pre-trained Model

Provides model weights pre-trained on the Cityscapes dataset, ready for direct inference

Model Capabilities

Urban scene semantic segmentation

Pixel-level classification

Urban scene understanding

Use Cases

Intelligent Transportation

Road Scene Parsing

Identifies traffic elements such as roads, vehicles, and pedestrians

Can be used for environmental perception in autonomous driving systems

Urban Planning

Urban Infrastructure Analysis

Identifies urban elements like buildings, roads, and green belts

Assists in urban planning decision-making

🚀 Segformer Model Card

This model card provides details about the Segformer model for image segmentation, including how to load the trained model, its initialization parameters, and the dataset used.

🚀 Quick Start

You can quickly start using the Segformer model by following the steps below. Click the button to open the Colab notebook for inference with the pre - trained model.

📦 Installation

First, install the required libraries:

pip install -U segmentation_models_pytorch albumentations

💻 Usage Examples

Basic Usage

The following code shows how to load a pre - trained Segformer model and perform inference on an image:

import torch
import requests
import numpy as np
import albumentations as A
import segmentation_models_pytorch as smp

from PIL import Image

device = "cuda" if torch.cuda.is_available() else "cpu"

# Load pretrained model and preprocessing function
checkpoint = "smp-hub/segformer-b2-1024x1024-city-160k"
model = smp.from_pretrained(checkpoint).eval().to(device)
preprocessing = A.Compose.from_pretrained(checkpoint)

# Load image
url = "https://huggingface.co/datasets/hf-internal-testing/fixtures_ade20k/resolve/main/ADE_val_00000001.jpg"
image = Image.open(requests.get(url, stream=True).raw)

# Preprocess image
np_image = np.array(image)
normalized_image = preprocessing(image=np_image)["image"]
input_tensor = torch.as_tensor(normalized_image)
input_tensor = input_tensor.permute(2, 0, 1).unsqueeze(0)  # HWC -> BCHW
input_tensor = input_tensor.to(device)

# Perform inference
with torch.no_grad():
    output_mask = model(input_tensor)

# Postprocess mask
mask = torch.nn.functional.interpolate(
    output_mask, size=(image.height, image.width), mode="bilinear", align_corners=False
)
mask = mask.argmax(1).cpu().numpy()  # argmax over predicted classes (channels dim)

🔧 Technical Details

Model init parameters

The following are the initialization parameters for the model:

model_init_params = {
    "encoder_name": "mit_b2",
    "encoder_depth": 5,
    "encoder_weights": None,
    "decoder_segmentation_channels": 768,
    "in_channels": 3,
    "classes": 19,
    "activation": None,
    "aux_params": None
}

Dataset

The model is trained on the Cityscapes dataset.

📚 Documentation

Library: https://github.com/qubvel/segmentation_models.pytorch
Docs: https://smp.readthedocs.io/en/latest/

📄 License

The license information can be found at: https://github.com/NVlabs/SegFormer/blob/master/LICENSE

This model has been pushed to the Hub using the PytorchModelHubMixin

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご