Udop Large 512
U
Udop Large 512
Developed by microsoft
UDOP is a universal document processing model that unifies vision, text, and layout, based on the T5 architecture, suitable for tasks such as document image classification, parsing, and visual question answering.
Downloads 193
Release Time : 2/26/2024
Model Overview
UDOP employs a T5-based encoder-decoder Transformer architecture, integrating visual, textual, and layout information for document AI tasks.
Model Features
Multimodal Unified Processing
Integrates visual, textual, and layout information for joint processing
Universal Document Processing
Supports various document AI tasks, including classification, parsing, and question answering
Based on T5 Architecture
Utilizes the proven T5 encoder-decoder Transformer architecture
Model Capabilities
Document image classification
Document structure parsing
Document visual question answering
Document semantic understanding
Use Cases
Document Processing
Table Information Extraction
Extract table data from document images
Example output: 9/30/92
Document Classification
Classify document images
Featured Recommended AI Models
Š 2025AIbase