Sapiens Depth 1b Torchscript
Sapiens is a vision transformer series model pre-trained on 300 million 1024 x 1024 resolution human images, focusing on human-centric vision tasks.
Downloads 160
Release Time : 9/9/2024
Model Overview
This model is used to estimate relative depth in human images, supports 1K high-resolution inference, and demonstrates outstanding generalization capabilities on real-world data.
Model Features
High-resolution support
Natively supports 1K high-resolution inference, suitable for high-quality image processing.
Outstanding generalization capability
Demonstrates excellent generalization performance on real-world data even with scarce or completely synthetic labeled data.
Large-scale pre-training
Pre-trained on 300 million human images, equipped with powerful feature extraction capabilities.
Model Capabilities
Human image depth estimation
High-resolution image processing
Visual feature extraction
Use Cases
Computer vision
Human depth perception
Used to estimate relative depth information of various body parts in human images
Can generate precise depth maps
Virtual reality applications
Provides depth information support for character modeling in VR/AR systems
Featured Recommended AI Models
Š 2025AIbase