# Visual Encoder-Decoder
Donut Demo
MIT
VisionEncoderDecoder model fine-tuned on the CORD-v2 dataset for document understanding tasks
Text Recognition
Transformers

D
nielsr
56
1
Vit2distilgpt2
MIT
This is an image-to-text generation model capable of receiving images and outputting descriptive text.
Image-to-Text
Transformers English

V
sachin
49
8
Featured Recommended AI Models