Donut Base Sroie
A document understanding model fine-tuned from naver-clova-ix/donut-base, specialized in structured document information extraction tasks
Downloads 15
Release Time : 3/23/2023
Model Overview
This model is a vision-language model based on the Donut architecture, specifically designed for extracting structured information from scanned documents. Suitable for automated processing of receipts, invoices, and similar documents.
Model Features
Document Understanding Capability
Can comprehend text and layout information in scanned documents
End-to-End Processing
Directly processes from image input to structured output without OCR preprocessing
Fine-Tuning Adaptation
Optimized for specific document types (e.g., receipts)
Model Capabilities
Document Image Understanding
Structured Information Extraction
Receipt Data Processing
Invoice Information Recognition
Use Cases
Document Automation
Receipt Information Extraction
Automatically extracts merchant, date, amount, and other information from scanned receipts
Automated financial record processing
Invoice Processing
Identifies key fields in invoices and stores them in a structured format
Simplifies corporate financial processes
Featured Recommended AI Models
Š 2025AIbase