S

Sapiens Depth 1b Bfloat16

Developed by facebook
Sapiens is a vision Transformer model pre-trained on 300 million 1024x1024 resolution portrait images, focusing on human-centric vision tasks.
Downloads 37
Release Time : 9/10/2024

Model Overview

This model is used for relative depth estimation of portrait images, supports 1K high-resolution inference, and demonstrates exceptional generalization capabilities on real data even when labeled data is scarce or entirely synthetic.

Model Features

High-Resolution Support
Native support for 1K high-resolution inference, with image sizes up to 1024x768.
Large-Scale Pre-Training
Pre-trained on 300 million 1024x1024 resolution portrait images.
Exceptional Generalization
Demonstrates exceptional generalization capabilities on real data even when labeled data is scarce or entirely synthetic.

Model Capabilities

Portrait Image Depth Estimation
High-Resolution Image Processing

Use Cases

Computer Vision
Portrait Depth Estimation
Used to estimate the relative depth information of portrait images.
Demonstrates exceptional generalization capabilities on real data.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase