L

Layoutlm Invoices

Developed by magorshunov
A document QA model fine-tuned based on the LayoutLM architecture, specifically designed for processing discontinuous text recognition in invoices and other documents
Downloads 145
Release Time : 6/16/2023

Model Overview

This model is a multimodal model optimized for QA tasks on invoices and other documents, capable of recognizing cross-region discontinuous text, addressing the shortcomings of traditional models in recognizing multi-line text such as addresses

Model Features

Discontinuous Text Recognition
Can recognize cross-region discontinuous text through an additional classification head, overcoming the limitation of traditional models that can only predict continuous text segments
Multimodal Processing Capability
Combines text and visual information for document understanding, suitable for structured documents like invoices
Domain-Specific Optimization
Specifically optimized for invoice processing scenarios, excelling in financial document processing

Model Capabilities

Invoice Information Extraction
Document Visual QA
Cross-line Text Recognition
Structured Document Understanding

Use Cases

Financial Document Processing
Invoice Number Recognition
Accurately extract invoice number information from invoice documents
Successfully recognized cross-line discontinuous address text
Purchase Amount Extraction
Extract purchase amount information from contracts or invoices
Accurately identified numerical information in documents
Document Automation
Document Information Extraction
Automatically process key information in large volumes of documents
Improved document processing efficiency
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase