Open-source layoutlmv2-large-uncased-finetuned-infovqa Model - Empowering Document Information Understanding and Extraction

Home

Layoutlmv2 Large Uncased Finetuned Infovqa

Developed by tiennvcs

Document understanding model based on the LayoutLMv2 architecture, fine-tuned for InfoVQA tasks

Question Answering System

Transformers

#Document Visual Question Answering #Multimodal Understanding #Form Information Extraction

Downloads 16

Release Time : 3/2/2022

Model Overview

This model is a document understanding model based on the LayoutLMv2 architecture, specifically fine-tuned for Information Visual Question Answering (InfoVQA) tasks. It can process documents containing text and layout information to answer questions related to the document content.

Model Features

Multimodal Understanding

Capable of processing both textual content and visual layout information simultaneously

Document Question Answering

Optimized specifically for document information question answering tasks

Large-scale Pretraining

Fine-tuned based on the large LayoutLMv2 model, with powerful document understanding capabilities

Model Capabilities

Document Understanding

Visual Question Answering

Text Layout Analysis

Information Extraction

Use Cases

Document Processing

Form Information Extraction

Extract specific information from structured documents and answer questions

Document Content Question Answering

Answer user questions based on document content

Training Loss	Epoch	Step	Validation Loss
4.1829	0.08	500	3.6339
3.5002	0.16	1000	3.0721
2.9556	0.24	1500	2.8731
2.8939	0.33	2000	3.1566
2.6986	0.41	2500	3.1023
2.7569	0.49	3000	2.7743
2.6391	0.57	3500	2.5023
2.4277	0.65	4000	2.5465
2.4242	0.73	4500	2.4709
2.3978	0.82	5000	2.4019
2.2653	0.9	5500	2.3383
2.3916	0.98	6000	2.4765
1.9423	1.06	6500	2.3798
1.8538	1.14	7000	2.3628
1.8136	1.22	7500	2.3671
1.7808	1.31	8000	2.5585
1.7772	1.39	8500	2.5862
1.755	1.47	9000	2.3105
1.6529	1.55	9500	2.2417
1.6956	1.63	10000	2.1755
1.5713	1.71	10500	2.2917
1.565	1.79	11000	2.0838
1.615	1.88	11500	2.2111
1.5249	1.96	12000	2.2207

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Layoutlmv2 Large Uncased Finetuned Infovqa

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 layoutlmv2-large-uncased-finetuned-infovqa

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License