L

Layoutlmv2 Large Uncased Finetuned Infovqa

Developed by tiennvcs
Document understanding model based on the LayoutLMv2 architecture, fine-tuned for InfoVQA tasks
Downloads 16
Release Time : 3/2/2022

Model Overview

This model is a document understanding model based on the LayoutLMv2 architecture, specifically fine-tuned for Information Visual Question Answering (InfoVQA) tasks. It can process documents containing text and layout information to answer questions related to the document content.

Model Features

Multimodal Understanding
Capable of processing both textual content and visual layout information simultaneously
Document Question Answering
Optimized specifically for document information question answering tasks
Large-scale Pretraining
Fine-tuned based on the large LayoutLMv2 model, with powerful document understanding capabilities

Model Capabilities

Document Understanding
Visual Question Answering
Text Layout Analysis
Information Extraction

Use Cases

Document Processing
Form Information Extraction
Extract specific information from structured documents and answer questions
Document Content Question Answering
Answer user questions based on document content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase