D

Detr Resnet 50 Dc5

Developed by facebook
DETR is an end-to-end object detection model based on Transformer architecture, using ResNet-50 as the backbone network and trained on the COCO dataset.
Downloads 4,038
Release Time : 3/2/2022

Model Overview

This model is used for object detection tasks, capable of identifying objects in images and annotating their bounding boxes. It employs a Transformer encoder-decoder structure combined with a convolutional backbone network, using an object query mechanism to detect objects in images.

Model Features

End-to-end object detection
Directly outputs detection results without complex post-processing steps (e.g., non-maximum suppression).
Transformer-based architecture
Utilizes the self-attention mechanism of Transformers to process image features, enabling global context understanding.
Object query mechanism
Uses 100 object queries to detect objects in images, with each query responsible for finding a specific object.
Bipartite matching loss
Employs the Hungarian algorithm to match predictions with ground truth annotations, optimizing the model training process.

Model Capabilities

Object detection
Bounding box prediction
Multi-class recognition

Use Cases

Computer vision
General object detection
Detects common objects in natural scene images, such as people, animals, vehicles, etc.
Achieves 43.3 AP on the COCO validation set
Scene understanding
Analyzes object distribution and spatial relationships in complex scenes.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase