Layoutlmv3 Finetuned Cord
A document understanding model fine-tuned on the CORD dataset based on LayoutLMv3, excelling in document token classification tasks
Downloads 617
Release Time : 5/2/2022
Model Overview
This model is a fine-tuned version of Microsoft's LayoutLMv3 architecture, specifically designed for document token classification tasks in the CORD dataset, capable of accurately identifying and classifying text elements in documents
Model Features
High-precision Document Understanding
Achieves over 96% F1 score on the CORD dataset, accurately identifying various text elements in documents
Multimodal Processing Capability
Combines textual content and visual layout information for comprehensive analysis
End-to-End Training
Supports complete processing from raw document images to final classification results
Model Capabilities
Document Token Classification
Document Layout Analysis
Text Element Recognition
Structured Document Understanding
Use Cases
Document Processing
Receipt Information Extraction
Automatically extracts merchant, date, amount and other information from scanned receipts
96.8% accuracy
Table Data Recognition
Identifies table structures in documents and extracts content
Financial Automation
Invoice Processing
Automates enterprise invoice processing to extract key financial information
Featured Recommended AI Models