U

Udop Large 512

Developed by microsoft
UDOP is a universal document processing model that unifies vision, text, and layout, based on the T5 architecture, suitable for tasks such as document image classification, parsing, and visual question answering.
Downloads 193
Release Time : 2/26/2024

Model Overview

UDOP employs a T5-based encoder-decoder Transformer architecture, integrating visual, textual, and layout information for document AI tasks.

Model Features

Multimodal Unified Processing
Integrates visual, textual, and layout information for joint processing
Universal Document Processing
Supports various document AI tasks, including classification, parsing, and question answering
Based on T5 Architecture
Utilizes the proven T5 encoder-decoder Transformer architecture

Model Capabilities

Document image classification
Document structure parsing
Document visual question answering
Document semantic understanding

Use Cases

Document Processing
Table Information Extraction
Extract table data from document images
Example output: 9/30/92
Document Classification
Classify document images
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase