Sapiens Seg Foreground 1b Torchscript
Sapiens is a vision transformer model pre-trained on 300 million high-resolution human images, specifically designed for foreground person segmentation tasks.
Downloads 25
Release Time : 9/9/2024
Model Overview
This model is used to segment foreground figures from images, supports 1K high-resolution inference, and demonstrates outstanding generalization capabilities in real-world scenarios.
Model Features
High-resolution support
Natively supports 1K high-resolution inference with image dimensions up to 1024 x 768.
Large-scale pre-training
Pre-trained on 300 million human images at 1024 x 1024 resolution.
Exceptional generalization
Demonstrates excellent generalization on real data even with scarce annotations or fully synthetic conditions.
Model Capabilities
Foreground person segmentation
High-resolution image processing
Use Cases
Image editing
Person-background separation
Precisely separates foreground figures from the background in images.
Generates high-quality foreground segmentation results
Virtual reality
Avatar creation
Used to create virtual avatars based on real people.
Featured Recommended AI Models
Š 2025AIbase