# Document Information Extraction
Vintern 1B V3 5
MIT
Vintern-1B-v3.5 is a multimodal large language model fine-tuned based on InternVL2.5-1B, specializing in Vietnamese text processing, excelling in OCR and understanding Vietnamese-specific documents.
Image-to-Text
Transformers Supports Multiple Languages

V
5CD-AI
6,875
35
Table Transformer Structure Recognition
MIT
A Table Transformer model trained on the PubTables1M dataset for extracting table structures from unstructured documents
Text Recognition
Transformers

T
microsoft
1.2M
186
Lmv2 G Aadhaar 236doc 06 14
This model is a fine-tuned version based on microsoft/layoutlmv2-base-uncased, specializing in document information extraction tasks, excelling in extracting fields such as Aadhaar card numbers, date of birth, gender, and names.
Sequence Labeling
Transformers

L
Sebabrata
52
0
Layoutlmv3 Finetuned Sroie
A document understanding model fine-tuned on the SROIE dataset based on Microsoft's LayoutLMv3-base model, excelling in extracting structured information from scanned documents
Text Recognition
Transformers

L
Theivaprakasham
409
0
Layoutlmv2 Finetuned Sroie
A document information extraction model fine-tuned on the SROIE dataset based on the LayoutLMv2 architecture, excelling at extracting key fields from receipt documents
Sequence Labeling
Transformers

L
Theivaprakasham
71
2
Featured Recommended AI Models