D

Donut Base Sroie

Developed by philschmid
A document understanding model fine-tuned from naver-clova-ix/donut-base, suitable for image text extraction tasks
Downloads 185
Release Time : 9/2/2022

Model Overview

This model is a document understanding model based on the Donut architecture, specifically fine-tuned for text extraction tasks in images. It is suitable for processing image documents containing text, such as receipts and invoices.

Model Features

Document Image Understanding
Optimized for text extraction tasks in document images (e.g., receipts, invoices)
Transformer-based Architecture
Utilizes the Donut architecture, combining vision and language processing capabilities
End-to-End Processing
Directly processes from image input to text output without intermediate OCR steps

Model Capabilities

Document image text extraction
Receipt information recognition
Invoice data extraction

Use Cases

Business Document Processing
Receipt Information Extraction
Automatically extracts key information from scanned or photographed receipts
Invoice Data Processing
Automatically identifies information such as amount, date, and supplier in invoices
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase