D

Detr Resnet 50 Panoptic

Developed by facebook
DETR is an end-to-end object detection model based on Transformer architecture, using ResNet-50 as the backbone network, trained on the COCO dataset, and supports object detection and panoptic segmentation tasks.
Downloads 9,586
Release Time : 3/2/2022

Model Overview

DEtection TRansformer (DETR) is an end-to-end model trained on the COCO 2017 panoptic dataset, employing an encoder-decoder Transformer architecture combined with a convolutional backbone network, achieving object detection and segmentation through an object query mechanism.

Model Features

End-to-End Object Detection
Directly outputs detection results without complex post-processing steps like anchor boxes and Non-Maximum Suppression (NMS) required in traditional object detection models.
Transformer Architecture
Uses an encoder-decoder Transformer structure, leveraging self-attention mechanisms to process global contextual information.
Object Query Mechanism
Predicts objects in an image through 100 fixed object queries, with each query corresponding to a specific object.
Bipartite Matching Loss
Uses the Hungarian algorithm to establish optimal one-to-one mappings between predictions and ground truth annotations, optimizing model training.

Model Capabilities

Object Detection
Panoptic Segmentation
Image Analysis

Use Cases

Computer Vision
Sports Event Analysis
Detect and segment players, the ball, and field elements in football matches
Animal Recognition
Identify and segment different animal species in images
Scene Understanding
Analyze various objects in complex scenes such as construction sites
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase