detr-resnet-50-sku110k開源目標檢測模型 - 免費部署助力商品貨架檢測

首頁

Detr Resnet 50 Sku110k

由isalia99開發

該DETR模型在SKU110K目標檢測數據集上進行了端到端訓練，查詢數設置為400，適用於商品貨架檢測等場景。

目標檢測

Transformers

開源協議:Apache-2.0 #密集商品檢測 #400查詢數優化 #零售場景專用

下載量 4,066

發布時間 : 3/14/2024

模型概述

這是一個基於DETR架構的目標檢測模型，使用ResNet-50作為骨幹網絡，在SKU110K數據集上進行了微調訓練，專門用於零售場景中的商品檢測。

模型特點

400查詢數設計

相比原始DETR模型，該版本將查詢數設置為400，可能更適合處理密集場景的目標檢測任務。

SKU110K數據集優化

專門針對零售商品檢測場景進行優化，在SKU110K數據集上表現良好。

兩階段訓練策略

採用先微調解碼器再微調整個網絡的訓練策略，可能有助於提高模型性能。

模型能力

商品目標檢測

零售場景圖像分析

密集物體識別

使用案例

零售行業

貨架商品檢測

自動識別和定位零售貨架上的商品

在SKU110K驗證集上達到58.9 mAP

庫存管理

通過圖像分析自動統計商品數量和位置

🚀 DETR（端到端目標檢測）模型

本項目是一個基於ResNet - 50骨幹網絡的DETR（端到端目標檢測）模型，在SKU110K數據集上進行訓練，設置了400個查詢數（num_queries）。該模型能夠有效解決目標檢測問題，為相關領域的應用提供了強大的支持。

🚀 快速開始

DETR（Detection Transformer）模型在SKU110K目標檢測數據集（包含8000張標註圖像）上進行了端到端的訓練。與原始模型相比，主要區別在於設置了400個查詢數（num_queries），並且在SKU110K數據集上進行了預訓練。

模型使用方法

以下是使用該模型的示例代碼：

from transformers import DetrImageProcessor, DetrForObjectDetection
import torch
from PIL import Image, ImageOps
import requests

url = "https://github.com/Isalia20/DETR-finetune/blob/main/IMG_3507.jpg?raw=true"
image = Image.open(requests.get(url, stream=True).raw)
image = ImageOps.exif_transpose(image)

# you can specify the revision tag if you don't want the timm dependency
processor = DetrImageProcessor.from_pretrained("facebook/detr-resnet-50", revision="no_timm")
model = DetrForObjectDetection.from_pretrained("isalia99/detr-resnet-50-sku110k")
model = model.eval()
inputs = processor(images=image, return_tensors="pt")
outputs = model(**inputs)

# convert outputs (bounding boxes and class logits) to COCO API
# let's only keep detections with score > 0.8
target_sizes = torch.tensor([image.size[::-1]])
results = processor.post_process_object_detection(outputs, target_sizes=target_sizes, threshold=0.8)[0]

for score, label, box in zip(results["scores"], results["labels"], results["boxes"]):
        box = [round(i, 2) for i in box.tolist()]
        print(
                f"Detected {model.config.id2label[label.item()]} with confidence "
                f"{round(score.item(), 3)} at location {box}"
        )

代碼運行後，預期輸出如下：

Detected LABEL_1 with confidence 0.983 at location [665.49, 480.05, 708.15, 650.11]
Detected LABEL_1 with confidence 0.938 at location [204.99, 1405.9, 239.9, 1546.5]
...
Detected LABEL_1 with confidence 0.998 at location [772.85, 169.49, 829.67, 372.18]
Detected LABEL_1 with confidence 0.999 at location [828.28, 1475.16, 874.37, 1593.43]

目前，特徵提取器和模型均支持PyTorch。