Layoutlm Invoices
A document QA model fine-tuned based on the LayoutLM architecture, specifically designed for processing discontinuous text recognition in invoices and other documents
Downloads 145
Release Time : 6/16/2023
Model Overview
This model is a multimodal model optimized for QA tasks on invoices and other documents, capable of recognizing cross-region discontinuous text, addressing the shortcomings of traditional models in recognizing multi-line text such as addresses
Model Features
Discontinuous Text Recognition
Can recognize cross-region discontinuous text through an additional classification head, overcoming the limitation of traditional models that can only predict continuous text segments
Multimodal Processing Capability
Combines text and visual information for document understanding, suitable for structured documents like invoices
Domain-Specific Optimization
Specifically optimized for invoice processing scenarios, excelling in financial document processing
Model Capabilities
Invoice Information Extraction
Document Visual QA
Cross-line Text Recognition
Structured Document Understanding
Use Cases
Financial Document Processing
Invoice Number Recognition
Accurately extract invoice number information from invoice documents
Successfully recognized cross-line discontinuous address text
Purchase Amount Extraction
Extract purchase amount information from contracts or invoices
Accurately identified numerical information in documents
Document Automation
Document Information Extraction
Automatically process key information in large volumes of documents
Improved document processing efficiency
Featured Recommended AI Models
Š 2025AIbase