detr-resnet-101-dc5-sku110k開源目標檢測模型

首頁

Detr Resnet 101 Dc5 Sku110k

由isalia99開發

這是一個基於DETR架構的目標檢測模型，使用ResNet-101-DC5作為骨幹網絡，在SKU110K數據集上訓練，查詢數設置為400。

目標檢測

Transformers

開源協議:Apache-2.0 #零售貨架檢測 #400查詢數優化 #DETR架構

下載量 129

發布時間 : 3/18/2024

模型概述

該模型專門用於目標檢測任務，特別適用於零售商品檢測場景。

模型特點

400查詢數設計

相比原始DETR模型，該模型將查詢數設置為400，可能提高了檢測密集小目標的能力

SKU110K數據集預訓練

專門針對零售商品檢測場景進行優化，在SKU110K數據集上進行了端到端訓練

端到端訓練

採用DETR的端到端訓練方式，無需複雜的後處理流程

模型能力

目標檢測

零售商品識別

密集小物體檢測

使用案例

零售行業

貨架商品檢測

自動檢測和識別零售貨架上的商品

在SKU110K驗證集上mAP達到59.8

庫存管理

輔助零售商店進行自動化庫存盤點

🚀 DETR（端到端目標檢測）模型：基於ResNet - 101 - DC5骨幹網絡，在SKU110K數據集上訓練，num_queries為400

本項目的DETR（Detection Transformer）模型在SKU110K目標檢測數據集（包含8000張帶註釋圖像）上進行了端到端訓練。與原始模型的主要區別在於，本模型的num_queries設置為400，並且在SKU110K數據集上進行了預訓練。

🚀 快速開始

模型使用方法

以下是使用該模型的示例代碼：

from transformers import DetrImageProcessor, DetrForObjectDetection
import torch
from PIL import Image, ImageOps
import requests

url = "https://github.com/Isalia20/DETR-finetune/blob/main/IMG_3507.jpg?raw=true"
image = Image.open(requests.get(url, stream=True).raw)
image = ImageOps.exif_transpose(image)

# you can specify the revision tag if you don't want the timm dependency
processor = DetrImageProcessor.from_pretrained("facebook/detr-resnet-101-dc5")
model = DetrForObjectDetection.from_pretrained("isalia99/detr-resnet-101-dc5-sku110k")
model = model.eval()
inputs = processor(images=image, return_tensors="pt")
outputs = model(**inputs)

# convert outputs (bounding boxes and class logits) to COCO API
# let's only keep detections with score > 0.8
target_sizes = torch.tensor([image.size[::-1]])
results = processor.post_process_object_detection(outputs, target_sizes=target_sizes, threshold=0.8)[0]

for score, label, box in zip(results["scores"], results["labels"], results["boxes"]):
        box = [round(i, 2) for i in box.tolist()]
        print(
                f"Detected {model.config.id2label[label.item()]} with confidence "
                f"{round(score.item(), 3)} at location {box}"
        )

運行上述代碼，輸出示例如下：

Detected LABEL_1 with confidence 0.983 at location [665.49, 480.05, 708.15, 650.11]
Detected LABEL_1 with confidence 0.938 at location [204.99, 1405.9, 239.9, 1546.5]
...
Detected LABEL_1 with confidence 0.998 at location [772.85, 169.49, 829.67, 372.18]
Detected LABEL_1 with confidence 0.999 at location [828.28, 1475.16, 874.37, 1593.43]

目前，特徵提取器和模型均支持PyTorch。