Sapiens Seg 1b Bfloat16
Sapiens is a Vision Transformer model pre-trained on 300 million high-resolution human images, specializing in human-centric vision tasks
Downloads 42
Release Time : 9/10/2024
Model Overview
This model performs 28-class human body part segmentation, supports 1K high-resolution inference, and demonstrates exceptional generalization in real-world scenarios
Model Features
High-Resolution Support
Natively supports 1024x1024 resolution input, ideal for high-precision segmentation tasks
Large-Scale Pre-training
Pre-trained on 300 million human images, learning rich visual features
Real-World Generalization
Maintains strong performance on real data even with scarce annotations or fully synthetic conditions
Efficient Inference
Optimized with bfloat16 format to balance accuracy and computational efficiency
Model Capabilities
Human body part segmentation
High-resolution image processing
Multi-class semantic segmentation
Use Cases
Medical Imaging
Surgical Planning Assistance
Used for precise segmentation of human anatomy pre-surgery
Provides accurate segmentation results for 28 body parts
Virtual Reality
Virtual Avatar Creation
Used for generating high-fidelity body part segmentation for virtual characters
Supports realistic virtual avatar body part recognition
Featured Recommended AI Models