Sapiens-pretrain-1b-bfloat16 Open-Source Vision Model - High-Resolution Inference, Adapted to Real-World Scenarios!

Sapiens Pretrain 1b Bfloat16

Developed by facebook

Sapiens is a vision Transformer model pre-trained on 300 million 1024×1024 resolution human images, supporting high-resolution inference and real-world scenario generalization.

Image Classification English#High-resolution human images #Vision Transformer #1-billion parameter model

Downloads 23

Release Time : 9/10/2024

Model Overview

This model is a pre-trained vision Transformer designed for human-centric visual tasks, demonstrating exceptional generalization capabilities on real data even with scarce annotations or fully synthetic conditions.

Model Features

High-resolution support

Native support for 1024×1024 high-resolution image processing with 16×16 patch size

Large-scale pre-training

Pre-trained on 300 million human images with powerful feature extraction capabilities

Real-world generalization

Demonstrates exceptional generalization on real data even with scarce annotations or fully synthetic conditions

Efficient computation

Uses bfloat16 data format with 4.647 trillion floating-point operations

Model Capabilities

High-resolution image processing

Human image feature extraction

Visual representation learning

Transfer learning

Use Cases

Computer vision

Human pose estimation

Utilizing pre-trained features for human pose recognition

Maintains high accuracy even with limited annotated data

Virtual avatar generation

Used for generating realistic human virtual avatars

Enhances realism and detail in generated results

Medical imaging

Medical image analysis

Applied to feature extraction from X-ray, MRI, and other medical images

Provides valuable feature representations even with limited data

Property	Details
Image Size	1024 x 1024
Num Parameters	1.169 B
FLOPs	4.647 TFLOPs
Patch Size	16 x 16
Embedding Dimensions	1536
Num Layers	40
Num Heads	24
Feedforward Channels	6144

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Sapiens Pretrain 1b Bfloat16

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Pretrain-Sapiens-1B-Bfloat16

🚀 Quick Start

✨ Features

📚 Documentation

Model Card

More Resources

💻 Usage Examples

📄 License