Sapiens Depth 1b Bfloat16
Sapiens is a vision Transformer model pre-trained on 300 million 1024x1024 resolution portrait images, focusing on human-centric vision tasks.
Downloads 37
Release Time : 9/10/2024
Model Overview
This model is used for relative depth estimation of portrait images, supports 1K high-resolution inference, and demonstrates exceptional generalization capabilities on real data even when labeled data is scarce or entirely synthetic.
Model Features
High-Resolution Support
Native support for 1K high-resolution inference, with image sizes up to 1024x768.
Large-Scale Pre-Training
Pre-trained on 300 million 1024x1024 resolution portrait images.
Exceptional Generalization
Demonstrates exceptional generalization capabilities on real data even when labeled data is scarce or entirely synthetic.
Model Capabilities
Portrait Image Depth Estimation
High-Resolution Image Processing
Use Cases
Computer Vision
Portrait Depth Estimation
Used to estimate the relative depth information of portrait images.
Demonstrates exceptional generalization capabilities on real data.
Featured Recommended AI Models
Š 2025AIbase