Layoutlmv2 Large Uncased Finetuned Infovqa
Document understanding model based on the LayoutLMv2 architecture, fine-tuned for InfoVQA tasks
Downloads 16
Release Time : 3/2/2022
Model Overview
This model is a document understanding model based on the LayoutLMv2 architecture, specifically fine-tuned for Information Visual Question Answering (InfoVQA) tasks. It can process documents containing text and layout information to answer questions related to the document content.
Model Features
Multimodal Understanding
Capable of processing both textual content and visual layout information simultaneously
Document Question Answering
Optimized specifically for document information question answering tasks
Large-scale Pretraining
Fine-tuned based on the large LayoutLMv2 model, with powerful document understanding capabilities
Model Capabilities
Document Understanding
Visual Question Answering
Text Layout Analysis
Information Extraction
Use Cases
Document Processing
Form Information Extraction
Extract specific information from structured documents and answer questions
Document Content Question Answering
Answer user questions based on document content
Featured Recommended AI Models
Š 2025AIbase