Donut-base-sroie Open-source Document Understanding Model - Free for Image Text Extraction Tasks

Home

Donut Base Sroie

Developed by philschmid

A document understanding model fine-tuned from naver-clova-ix/donut-base, suitable for image text extraction tasks

Text Recognition

Transformers

Open Source License:MIT #Document Understanding #Image to Text #Structured Data Extraction

Downloads 185

Release Time : 9/2/2022

Model Overview

This model is a document understanding model based on the Donut architecture, specifically fine-tuned for text extraction tasks in images. It is suitable for processing image documents containing text, such as receipts and invoices.

Model Features

Document Image Understanding

Optimized for text extraction tasks in document images (e.g., receipts, invoices)

Transformer-based Architecture

Utilizes the Donut architecture, combining vision and language processing capabilities

End-to-End Processing

Directly processes from image input to text output without intermediate OCR steps

Model Capabilities

Document image text extraction

Receipt information recognition

Invoice data extraction

Use Cases

Business Document Processing

Receipt Information Extraction

Automatically extracts key information from scanned or photographed receipts

Invoice Data Processing

Automatically identifies information such as amount, date, and supplier in invoices

Property	Details
Model Type	donut - base - sroie
Training Data	imagefolder

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Donut Base Sroie

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 donut-base-sroie

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License