Sapiens Pose 0.6b Torchscript
Sapiens is a vision Transformer model pre-trained on 300 million high-resolution human images, specifically designed for pose estimation tasks, supporting 308 keypoint detection.
Downloads 29
Release Time : 9/18/2024
Model Overview
This model is a high-precision pose estimation model capable of detecting 308 keypoints on the body, face, hands, and feet, suitable for various human-centric vision tasks.
Model Features
High-resolution support
Natively supports 1024x1024 resolution input, suitable for high-precision pose estimation requirements.
Multi-part keypoint detection
Can simultaneously detect 308 keypoints on the body, face, hands, and feet.
Strong generalization capability
Demonstrates excellent generalization on real data even with scarce or fully synthetic labeled data.
Model Capabilities
Human pose estimation
Facial keypoint detection
Hand keypoint detection
Foot keypoint detection
Use Cases
Human-computer interaction
Virtual reality control
Used for precise human motion capture in VR environments
High-precision full-body motion tracking
Sports analysis
Athlete movement analysis
Analyze athletes' movement postures and techniques
Detects 308 keypoints for detailed analysis
Featured Recommended AI Models
Š 2025AIbase