# Multilingual Document Understanding
Erax VL 7B V2.0 Preview
Apache-2.0
EraX-VL-7B-V2.0-Preview is a powerful multimodal model designed for OCR and visual question answering, excelling in processing multiple languages including Vietnamese, with outstanding performance in recognizing medical forms, invoices, and other documents.
Image-to-Text
Transformers Supports Multiple Languages

E
erax-ai
476
22
Layout Xlm Base Finetuned With DocLayNet Base At Paragraphlevel Ml512
MIT
This model is a fine-tuned version of the LayoutXLM base model on the DocLayNet dataset, specifically designed for document layout analysis and paragraph-level content understanding.
Text Recognition
Transformers Supports Multiple Languages

L
pierreguillou
79
9
Layout Xlm Base Finetuned With DocLayNet Base At Linelevel Ml384
MIT
A line-level document understanding model fine-tuned on the DocLayNet dataset based on the LayoutXLM base model, supporting multilingual document layout analysis and token classification.
Text Recognition
Transformers Supports Multiple Languages

L
pierreguillou
103
3
Lilt Xlm Roberta Base Finetuned With DocLayNet Base At Paragraphlevel Ml512
MIT
This is a document understanding model specifically designed for analyzing document layout and content, performing token classification tasks at the paragraph level.
Text Recognition
Transformers Supports Multiple Languages

L
pierreguillou
126
3
Featured Recommended AI Models