P

PP DocBlockLayout

Developed by PaddlePaddle
PP-DocBlockLayout is a document layout block positioning model trained based on RT-DETR-L, which can effectively identify layout regions in various document types.
Downloads 1,039
Release Time : 6/6/2025

Model Overview

This model focuses on document layout analysis and can identify layout regions in various documents such as Chinese and English papers, PPTs, magazines, contracts, and books. It is suitable for document structuring and information extraction tasks.

Model Features

Support for multiple document types
The training data covers various document types such as Chinese and English papers, PPTs, magazines, contracts, and books, with wide applicability.
High-precision detection
It achieves an mAP(0.5) accuracy of 95.9% on the self-built dataset and can accurately identify layout regions in documents.
Easy integration
It provides a simple installation and usage method and supports quick integration into existing projects.

Model Capabilities

Document layout detection
Multi-document type recognition
Layout region positioning

Use Cases

Document processing
Paper structure analysis
Identify regions such as titles, main texts, and charts in papers to assist in paper structure analysis.
Accurately divide each part of the paper.
Contract information extraction
Locate key clause regions in contracts to facilitate subsequent information extraction.
Accurately identify contract clause regions
Education
Test paper analysis
Identify regions such as questions and options in test papers to assist the automatic grading system.
Accurately divide each question region of the test paper
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase