D

Deformable Detr Box Supervised

Developed by facebook
Deformable DETR is an object detection model based on Transformer architecture, trained on the LVIS dataset, supporting detection of 1203 object categories.
Downloads 193
Release Time : 2/27/2023

Model Overview

This model adopts the Deformable DETR architecture, combining a convolutional backbone network with a Transformer encoder-decoder, achieving efficient object detection through object query mechanisms.

Model Features

Large-scale Category Detection
Supports detection of 1203 object categories from the LVIS dataset, including rare categories.
Efficient Transformer Architecture
Utilizes the Deformable DETR architecture, improving computational efficiency through deformable attention mechanisms.
End-to-End Training
Directly outputs detection results without complex post-processing.

Model Capabilities

Multi-class Object Detection
Bounding Box Prediction
Large-scale Visual Recognition

Use Cases

General Object Detection
Scene Understanding
Detecting multiple objects in complex scenes
Achieves 31.7 mAP on the LVIS dataset
Rare Object Detection
Identifying uncommon object categories
Rare category mAP reaches 21.4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase