Model Selection

Monocular Depth Estimation

# Monocular Depth Estimation

Distill Any Depth Small Hf

Distill-Any-Depth is a SOTA monocular depth estimation model trained based on knowledge distillation algorithms, capable of efficient and accurate depth estimation.

DepthPro is a vision model for depth estimation, capable of predicting scene depth information from a single image.

Marigold E2e Ft Depth

A monocular depth estimation model based on the Apache-2.0 license, suitable for zero-shot depth estimation tasks in wild scenes.

Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, offering finer details and stronger robustness.

Depth Anything V2 Large Hf

Depth Anything V2 is currently the most powerful Monocular Depth Estimation (MDE) model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, offering finer details and stronger robustness.

Depth Anything V2 Small Hf

Depth Anything V2 is currently the most powerful monocular depth estimation model, trained on 595,000 synthetically annotated images and over 62 million real unlabeled images, featuring fine details and robustness.

Depth Anything V2 Base

Depth-Anything-V2-Base is an ONNX-format depth estimation model adapted for Transformers.js, designed for image depth estimation on the web.

Depth Anything V2 Base

Depth Anything V2 is currently the most powerful monocular depth estimation (MDE) model, trained on 595,000 synthetically annotated images and over 62 million real unannotated images.

3D Vision English

ZoeDepth is a vision model for monocular depth estimation, fine-tuned on the KITTI dataset, capable of achieving zero-shot transfer for metric depth estimation.

Marigold Depth Lcm V1 0

A monocular depth estimation model fine-tuned using latent consistency distillation for generating depth maps from single images

3D Vision English

Depth Anything Large Hf

ONNX version of depth estimation model based on Transformers.js, suitable for web applications

Depth Anything Vitb14

Depth Anything is a depth estimation model trained on large-scale unlabeled data, capable of predicting depth information from a single image.

Dpt Beit Base 384

DPT is a dense prediction transformer model based on the BEiT backbone network, designed for monocular depth estimation and trained on 1.4 million images.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase