Donut Base Sroie
A document understanding model fine-tuned from naver-clova-ix/donut-base, suitable for image text extraction tasks
Downloads 185
Release Time : 9/2/2022
Model Overview
This model is a document understanding model based on the Donut architecture, specifically fine-tuned for text extraction tasks in images. It is suitable for processing image documents containing text, such as receipts and invoices.
Model Features
Document Image Understanding
Optimized for text extraction tasks in document images (e.g., receipts, invoices)
Transformer-based Architecture
Utilizes the Donut architecture, combining vision and language processing capabilities
End-to-End Processing
Directly processes from image input to text output without intermediate OCR steps
Model Capabilities
Document image text extraction
Receipt information recognition
Invoice data extraction
Use Cases
Business Document Processing
Receipt Information Extraction
Automatically extracts key information from scanned or photographed receipts
Invoice Data Processing
Automatically identifies information such as amount, date, and supplier in invoices
Featured Recommended AI Models
Š 2025AIbase