S

Segformer B5 Finetuned IDD L2 V2

Developed by izzako
This model is an image segmentation model based on NVIDIA's MIT-B5 architecture, fine-tuned on the IDD 20K semantic segmentation dataset, suitable for road scene understanding tasks.
Downloads 29
Release Time : 3/25/2025

Model Overview

SegFormer-B5 is an efficient semantic segmentation model, fine-tuned on the IDD 20K dataset, capable of accurately identifying various objects and regions in road scenes, including roads, pedestrians, vehicles, buildings, etc.

Model Features

High-precision road scene segmentation
Fine-tuned on the IDD 20K dataset, it can accurately identify various scene elements such as roads, sidewalks, and vehicles.
Multi-category recognition capability
Supports recognition of over 20 different road scene categories, including static elements (e.g., roads, buildings) and dynamic elements (e.g., pedestrians, vehicles).
Optimized training parameters
Trained using the Adam optimizer and linear learning rate scheduler with a learning rate of 0.0006 for 50 epochs.

Model Capabilities

Image segmentation
Road scene understanding
Multi-category object recognition

Use Cases

Autonomous driving
Road scene parsing
Used for real-time understanding and segmentation of road environments in autonomous driving systems.
Achieved a mean Intersection over Union (mIoU) of 0.7180 on the IDD 20K evaluation set.
Intelligent transportation systems
Traffic element monitoring
Identifies and counts traffic participants such as vehicles and pedestrians on the road.
Cyclist recognition accuracy reached 0.8434, and pedestrian recognition accuracy reached 0.8057.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase