Distill-Any-Depth-Large-hf開源模型 - 實現精準單目深度估計，免費可用！

首頁

Distill Any Depth Large Hf

由xingyang1開發

Distill-Any-Depth是一種新的SOTA單目深度估計模型，採用知識蒸餾算法訓練而成。

3D視覺

Transformers

開源協議:MIT #單目深度估計 #知識蒸餾 #零樣本推理

下載量 2,322

發布時間 : 3/9/2025

模型概述

該模型是一種基於知識蒸餾的單目深度估計模型，能夠從單張圖像預測深度信息，適用於計算機視覺任務。

模型特點

知識蒸餾訓練

採用知識蒸餾算法訓練，能夠從大模型中提取知識並應用於深度估計任務。

單目深度估計

僅需單張圖像即可進行深度估計，無需多視角或深度傳感器。

零樣本推理

支持零樣本深度估計，無需特定任務的微調即可應用於新場景。

模型能力

單目深度估計

圖像深度預測

零樣本推理

使用案例

計算機視覺

場景理解

用於自動駕駛或機器人導航中的場景深度理解

能夠準確預測場景中各物體的相對深度

3D重建

作為3D場景重建的預處理步驟

提供單視圖的深度信息，輔助3D建模

🚀 Distill Any Depth Large - Transformers版本

Distill-Any-Depth是一個全新的單目深度估計模型，它採用了我們提出的知識蒸餾算法進行訓練，達到了當前最優水平。該模型在論文Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator中被首次提出。此模型的檢查點與transformers庫兼容，還提供了在線演示。

🚀 快速開始

本模型可用於零樣本深度估計，以下是使用方法。

💻 使用示例

基礎用法

使用pipeline進行深度估計：

from transformers import pipeline
from PIL import Image
import requests
# load pipe
pipe = pipeline(task="depth-estimation", model="xingyang1/Distill-Any-Depth-Large-hf")
# load image
url = 'http://images.cocodataset.org/val2017/000000039769.jpg'
image = Image.open(requests.get(url, stream=True).raw)
# inference
depth = pipe(image)["depth"]

高級用法

使用模型和處理器類進行深度估計：

from transformers import AutoImageProcessor, AutoModelForDepthEstimation
import torch
import numpy as np
from PIL import Image
import requests

url = "http://images.cocodataset.org/val2017/000000039769.jpg"
image = Image.open(requests.get(url, stream=True).raw)

image_processor = AutoImageProcessor.from_pretrained("xingyang1/Distill-Any-Depth-Large-hf")
model = AutoModelForDepthEstimation.from_pretrained("xingyang1/Distill-Any-Depth-Large-hf")

# prepare image for the model
inputs = image_processor(images=image, return_tensors="pt")

with torch.no_grad():
    outputs = model(**inputs)

# interpolate to original size and visualize the prediction
post_processed_output = image_processor.post_process_depth_estimation(
    outputs,
    target_sizes=[(image.height, image.width)],
)

predicted_depth = post_processed_output[0]["predicted_depth"]
depth = (predicted_depth - predicted_depth.min()) / (predicted_depth.max() - predicted_depth.min())
depth = depth.detach().cpu().numpy() * 255
depth = Image.fromarray(depth.astype("uint8"))

📄 許可證

本項目採用MIT許可證。

📚 詳細文檔

如果您覺得本項目有用，請考慮引用以下文獻：

@article{he2025distill,
  title   = {Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator},
  author  = {Xiankang He and Dongyan Guo and Hongji Li and Ruibo Li and Ying Cui and Chi Zhang},
  year    = {2025},
  journal = {arXiv preprint arXiv: 2502.19204}
}

👨‍💻 模型卡片作者

Parteek Kamboj

📋 模型信息

屬性	詳情
模型類型	單目深度估計模型
訓練數據	未提及
相關論文	Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator
在線演示	Distill-Any-Depth