# Monocular Depth Estimation

Distill Any Depth Small Hf
MIT
Distill-Any-Depth is a SOTA monocular depth estimation model trained based on knowledge distillation algorithms, capable of efficient and accurate depth estimation.
3D Vision Transformers
D
xingyang1
1,214
3
Depthpro ONNX
DepthPro is a vision model for depth estimation, capable of predicting scene depth information from a single image.
3D Vision Transformers
D
onnx-community
146
10
Marigold E2e Ft Depth
Apache-2.0
A monocular depth estimation model based on the Apache-2.0 license, suitable for zero-shot depth estimation tasks in wild scenes.
3D Vision
M
GonzaloMG
1,467
6
Depth
Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, offering finer details and stronger robustness.
3D Vision Transformers
D
scenario-labs
75
0
Depth Anything V2 Large Hf
Depth Anything V2 is currently the most powerful Monocular Depth Estimation (MDE) model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, offering finer details and stronger robustness.
3D Vision Transformers
D
depth-anything
83.99k
19
Depth Anything V2 Small Hf
Apache-2.0
Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, featuring fine details and robustness.
3D Vision Transformers
D
depth-anything
438.72k
15
Depth Anything V2 Base
Depth-Anything-V2-Base is an ONNX-format depth estimation model adapted for Transformers.js, designed for image depth estimation on the web.
3D Vision Transformers
D
onnx-community
56
0
Depth Anything V2 Base
Depth Anything V2 is currently the most powerful monocular depth estimation (MDE) model, trained on 595,000 synthetically annotated images and over 62 million real unannotated images.
3D Vision English
D
depth-anything
66.95k
17
Zoedepth Kitti
MIT
ZoeDepth is a vision model for monocular depth estimation, fine-tuned on the KITTI dataset, capable of achieving zero-shot transfer for metric depth estimation.
3D Vision Transformers
Z
Intel
7,037
2
Marigold Depth Lcm V1 0
Apache-2.0
A monocular depth estimation model fine-tuned using latent consistency distillation for generating depth maps from single images
3D Vision English
M
prs-eth
22.45k
55
Depth Anything Large Hf
ONNX version of depth estimation model based on Transformers.js, suitable for web applications
3D Vision Transformers
D
Xenova
19
3
Depth Anything Vitb14
Depth Anything is a depth estimation model trained on large-scale unlabeled data, capable of predicting depth information from a single image.
3D Vision Transformers
D
LiheYoung
7,152
3
Dpt Beit Base 384
MIT
DPT is a dense prediction transformer model based on the BEiT backbone network, designed for monocular depth estimation and trained on 1.4 million images.
3D Vision Transformers
D
Intel
25.98k
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase