S

Sapiens Seg 1b Torchscript

Developed by facebook
Sapiens is a series of vision transformers pre-trained on 300 million 1024×1024 resolution human images, specifically designed for human-centric vision tasks with exceptional generalization capabilities.
Downloads 892
Release Time : 9/9/2024

Model Overview

This model is a 1.169 billion parameter vision transformer, fine-tuned for high-resolution image segmentation tasks across 28 human body part categories.

Model Features

High-resolution support
Natively supports 1K high-resolution inference (1024×768), ideal for precise human body part segmentation.
Strong generalization capability
Demonstrates exceptional generalization to real-world data even with scarce annotations or fully synthetic scenarios.
Large-scale pre-training
Pre-trained on 300 million 1024×1024 resolution human images, featuring rich visual representation capabilities.

Model Capabilities

Human image segmentation
28 body part recognition
High-resolution image processing

Use Cases

Medical imaging
Surgical planning assistance
Used for precise segmentation and visualization of human body parts pre-surgery
Improves surgical planning accuracy
Virtual try-on
Virtual garment fitting
Accurate body part segmentation for more realistic virtual try-on effects
Enhances e-commerce user experience
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase