D

Dpt Swinv2 Tiny 256

Developed by Intel
DPT model based on SwinV2 backbone network for monocular depth estimation, trained on 1.4 million images.
Downloads 2,285
Release Time : 12/10/2023

Model Overview

This model is part of MiDaS version 3.1, using SwinV2 transformer as the backbone network, focusing on estimating depth information from a single image. Suitable for generative AI, 3D reconstruction, autonomous driving and other fields.

Model Features

Based on SwinV2 backbone network
Uses SwinV2 transformer as the backbone network, combining the advantages of hierarchical transformers to improve the efficiency and accuracy of depth estimation.
Large-scale training data
Trained on 1.4 million images, covering various scenarios, enhancing the model's generalization capability.
Zero-shot transfer capability
Supports zero-shot transfer, allowing application in new scenarios without fine-tuning.

Model Capabilities

Monocular depth estimation
Zero-shot transfer
Image depth analysis

Use Cases

Computer vision
3D reconstruction
Estimates depth information from a single image for 3D scene reconstruction.
Autonomous driving
Used for environmental perception and obstacle detection in autonomous driving systems.
Generative AI
Provides depth information for generative AI, enhancing the realism of image generation.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase