Layoutlmv2 Large Uncased Finetuned Vi Infovqa
A document visual question answering model fine-tuned based on microsoft/layoutlmv2-large-uncased, suitable for Vietnamese information extraction tasks
Downloads 16
Release Time : 3/2/2022
Model Overview
This model is a LayoutLMv2 model optimized for document visual question answering (VQA) tasks, specifically adapted for Vietnamese information extraction scenarios, capable of understanding document layout and visual information for question answering
Model Features
Multimodal Understanding Capability
Combines text, layout, and visual information for comprehensive understanding
Vietnamese Optimization
Specifically fine-tuned for Vietnamese document information extraction tasks
Document Structure Awareness
Capable of understanding document layout and structural information
Model Capabilities
Document Visual Question Answering
Vietnamese Information Extraction
Document Layout Analysis
Multimodal Understanding
Use Cases
Document Processing
Vietnamese Form Information Extraction
Automatically extracts key information from Vietnamese form documents
Document Visual Question Answering System
Answers natural language questions about document content
Featured Recommended AI Models
Š 2025AIbase