Sapiens Pose 0.6b
Sapiens is a family of vision Transformer models pre-trained on 300 million high-resolution human images, focusing on human-centric vision tasks.
Downloads 19
Release Time : 9/18/2024
Model Overview
Pose-Sapiens-0.6B is a vision Transformer model for pose estimation, supporting the estimation of 308 keypoints (body + face + hands + feet) on a single image.
Model Features
High-resolution support
Native support for 1K high-resolution inference, with image sizes up to 1024 x 768.
Outstanding generalization capability
Demonstrates excellent generalization to real-world data even with scarce labeled data or fully synthetic scenarios.
Multi-keypoint detection
Supports estimation of 308 keypoints across body, face, hands, and feet.
Model Capabilities
Human pose estimation
Facial keypoint detection
Hand keypoint detection
Foot keypoint detection
Use Cases
Computer vision
Human pose analysis
Used for human pose estimation in scenarios such as sports analysis and fitness coaching.
Virtual reality
Provides precise human pose data for virtual reality applications.
Featured Recommended AI Models
Š 2025AIbase