S

Segformer B5 Finetuned Human Parsing

Developed by matei-dorian
A vision segmentation model based on MIT-B5 architecture, specialized in pixel-level parsing of human body parts and clothing
Downloads 4,066
Release Time : 5/2/2023

Model Overview

This model is a fine-tuned version of NVIDIA's MIT-B5 architecture for human parsing tasks, capable of accurately segmenting human body parts (e.g., face, limbs) and clothing accessories (e.g., hats, shoes, bags) in images

Model Features

High-precision part segmentation
Achieves over 90% accuracy in key areas such as face and hair
Multi-category recognition
Supports simultaneous recognition of 20+ human body parts and clothing accessories
Efficient fine-tuning architecture
Based on the pre-trained SegFormer-B5 model, excellent performance can be achieved with minimal fine-tuning data

Model Capabilities

Human body part segmentation
Clothing accessory recognition
Pixel-level image parsing
Multi-category semantic segmentation

Use Cases

Fashion analysis
Virtual fitting effect analysis
Identifies various components of the user's clothing
Top recognition accuracy 85.53%, pants 89.13%
Human-computer interaction
Augmented reality applications
Precisely locates human body parts for virtual overlays
Face recognition accuracy 90.94%, arm recognition 85%+
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase