AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Document Image Processing

# Document Image Processing

Table Transformer Page Segmentation Floorplan
This is an image segmentation model based on the Transformer architecture, specifically designed for page layout and floor plan segmentation tasks.
Image Segmentation Transformers
T
digscom
22
0
Monkey Chat
The Monkey Model is a large multimodal model that excels in various visual tasks by enhancing image resolution and improving text labeling methods.
Image-to-Text Transformers
M
echo840
179
16
Dof Passport 1
MIT
A model fine-tuned based on naver-clova-ix/donut-base, specific purpose not explicitly stated
Image-to-Text Transformers
D
Sebabrata
16
0
Dof Receipts 1
MIT
Model fine-tuned based on naver-clova-ix/donut-base for processing image data
Text Recognition Transformers
D
Sebabrata
31
0
Donut Proto
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder for image-to-text conversion
Image-to-Text Transformers
D
naver-clova-ix
30
7
Donut Base
MIT
Donut is an OCR-free document understanding Transformer model composed of a visual encoder (Swin Transformer) and a text decoder (BART).
Image-to-Text Transformers
D
naver-clova-ix
50.34k
207
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase