D

Deformable Detr DocLayNet

Developed by Aryn
A deformable DETR model trained on the DocLayNet dataset for object detection tasks in document layout analysis, achieving a box mAP of 57.1.
Downloads 18.50k
Release Time : 3/19/2024

Model Overview

Deformable DEtection TRansformer (DETR) model, specifically designed for document layout analysis, capable of detecting various elements in documents.

Model Features

Deformable Attention Mechanism
Utilizes deformable attention modules to more efficiently handle irregular layouts and elements of varying sizes in documents.
End-to-End Training
Eliminates the need for anchor box design and NMS post-processing in traditional object detection, enabling end-to-end training and inference.
Document Layout Analysis Optimization
Specifically optimized for the DocLayNet dataset, accurately identifying 11 types of elements in documents such as tables, text paragraphs, and images.

Model Capabilities

Document Element Detection
Bounding Box Prediction
Category Classification

Use Cases

Document Processing
Document Layout Analysis
Automatically identifies various elements and their positional relationships in documents
57.1 box mAP
Document Digitization
Assists in converting paper documents into structured digital formats
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase