D

Donut Base Sroie

Developed by enoreyes
A document understanding model fine-tuned from naver-clova-ix/donut-base, specialized in structured document information extraction tasks
Downloads 15
Release Time : 3/23/2023

Model Overview

This model is a vision-language model based on the Donut architecture, specifically designed for extracting structured information from scanned documents. Suitable for automated processing of receipts, invoices, and similar documents.

Model Features

Document Understanding Capability
Can comprehend text and layout information in scanned documents
End-to-End Processing
Directly processes from image input to structured output without OCR preprocessing
Fine-Tuning Adaptation
Optimized for specific document types (e.g., receipts)

Model Capabilities

Document Image Understanding
Structured Information Extraction
Receipt Data Processing
Invoice Information Recognition

Use Cases

Document Automation
Receipt Information Extraction
Automatically extracts merchant, date, amount, and other information from scanned receipts
Automated financial record processing
Invoice Processing
Identifies key fields in invoices and stores them in a structured format
Simplifies corporate financial processes
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase