Layoutlmv2 Base Uncased Finetuned Docvqa
A document visual question answering model based on the LayoutLMv2 architecture, fine-tuned specifically for document understanding tasks
Downloads 14
Release Time : 6/22/2023
Model Overview
This model is a fine-tuned version of LayoutLMv2 base on the DocVQA task, capable of understanding document layouts and text content to answer questions about documents.
Model Features
Multimodal Understanding Capability
Processes both textual content and document layout information simultaneously
Document-Specific Optimization
Specially fine-tuned for document visual question answering tasks
End-to-End Training
Learns text and visual features directly from raw document images
Model Capabilities
Document Understanding
Visual Question Answering
Text Localization
Layout Analysis
Use Cases
Document Processing
Form Information Extraction
Extracts specific field information from structured documents
Document Q&A System
Answers natural language questions about document content
Enterprise Automation
Invoice Processing
Automatically identifies and extracts key information from invoices
Featured Recommended AI Models
Š 2025AIbase