Open-source Sapiens-pose-1b-torchscript Model - A Great Helper for High-precision Human Pose Estimation

Sapiens Pose 1b Torchscript

Developed by facebook

Sapiens is a vision Transformer model pre-trained on 300 million 1024x1024 resolution human images, specifically designed for high-precision pose estimation tasks.

Pose Estimation English#High-resolution pose estimation #Full-body multi-part keypoints #Billion-parameter large model

Downloads 1,245

Release Time : 9/9/2024

Model Overview

This model is used for estimating 308 keypoints on a single image, covering the body, face, hands, and feet, supporting 1K high-resolution inference with excellent generalization capabilities.

Model Features

High-resolution support

Natively supports 1K high-resolution inference with an input image size of 1024x768.

Multi-part keypoint detection

Can simultaneously detect 308 keypoints covering the body, face, hands, and feet.

Strong generalization capability

Demonstrates excellent generalization to real data even with scarce labeled data or fully synthetic scenarios.

Efficient computation

Computational cost of 4.647 trillion floating-point operations, balancing accuracy and efficiency.

Model Capabilities

Human pose estimation

Facial keypoint detection

Hand keypoint detection

Foot keypoint detection

High-resolution image processing

Use Cases

Sports analysis

Athlete motion analysis

Used to analyze athletes' motion postures to help improve technical movements.

Can accurately capture 308 full-body keypoints

Human-computer interaction

Gesture recognition

Used to recognize complex hand gestures for natural human-computer interaction.

High-precision hand keypoint detection

Virtual reality

Virtual avatar driving

Used for real-time driving of virtual avatars to achieve realistic motion capture.

Real-time full-body pose estimation

Property	Details
Image Size	1024 x 768 (H x W)
Num Parameters	1.169 B
FLOPs	4.647 TFLOPs
Patch Size	16 x 16
Embedding Dimensions	1536
Num Layers	40
Num Heads	24
Feedforward Channels	6144

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Sapiens Pose 1b Torchscript

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Pose-Sapiens-1B-Torchscript

🚀 Quick Start

✨ Features

📚 Documentation

Model Details

Model Card

More Resources

📄 License