Sapiens-pose-1b-bfloat16 Open-source Vision Model - Free Deployment to Boost Human-centered Vision Tasks

Sapiens Pose 1b Bfloat16

Developed by facebook

Sapiens is a vision transformer series model pre-trained on 300 million 1024x1024 resolution human images, focusing on human-centric vision tasks.

Pose Estimation English#High-resolution pose estimation #Full-body keypoint detection #ViT large model

Downloads 31

Release Time : 9/10/2024

Model Overview

This model estimates 308 keypoints (body + face + hands + feet) on a single image, supports 1K high-resolution inference, and exhibits exceptional generalization capabilities.

Model Features

High-resolution support

Natively supports 1K high-resolution inference, suitable for image sizes of 1024x768.

Large-scale pre-training

Pre-trained on 300 million human images, featuring powerful feature extraction capabilities.

Multi-keypoint detection

Capable of detecting 308 keypoints simultaneously for the body, face, hands, and feet.

Exceptional generalization

Demonstrates outstanding generalization to real-world data even with scarce labeled data or fully synthetic scenarios.

Model Capabilities

Human pose estimation

Facial keypoint detection

Hand keypoint detection

Foot keypoint detection

Use Cases

Computer vision

Human pose analysis

Used for human pose estimation in scenarios like motion analysis and fitness guidance.

Detects 308 keypoints, providing detailed human pose information.

Virtual reality

Enables precise human motion capture in VR/AR applications.

High-precision keypoint detection enhances virtual reality experiences.

Healthcare

Rehabilitation training monitoring

Monitors whether patients' rehabilitation training movements are standardized.

Property	Details
Image Size	1024 x 768 (H x W)
Num Parameters	1.169 B
FLOPs	4.647 TFLOPs
Patch Size	16 x 16
Embedding Dimensions	1536
Num Layers	40
Num Heads	24
Feedforward Channels	6144

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Sapiens Pose 1b Bfloat16

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Pose-Sapiens-1B-Bfloat16

✨ Features

📚 Documentation

Model Details

Model Card

More Resources

📄 License

💻 Usage Examples

Basic Usage