# Document Image Processing
Table Transformer Page Segmentation Floorplan
This is an image segmentation model based on the Transformer architecture, specifically designed for page layout and floor plan segmentation tasks.
Image Segmentation
Transformers

T
digscom
22
0
Monkey Chat
The Monkey Model is a large multimodal model that excels in various visual tasks by enhancing image resolution and improving text labeling methods.
Image-to-Text
Transformers

M
echo840
179
16
Dof Passport 1
MIT
A model fine-tuned based on naver-clova-ix/donut-base, specific purpose not explicitly stated
Image-to-Text
Transformers

D
Sebabrata
16
0
Dof Receipts 1
MIT
Model fine-tuned based on naver-clova-ix/donut-base for processing image data
Text Recognition
Transformers

D
Sebabrata
31
0
Donut Proto
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder for image-to-text conversion
Image-to-Text
Transformers

D
naver-clova-ix
30
7
Donut Base
MIT
Donut is an OCR-free document understanding Transformer model composed of a visual encoder (Swin Transformer) and a text decoder (BART).
Image-to-Text
Transformers

D
naver-clova-ix
50.34k
207
Featured Recommended AI Models