Donut-demo Open-source Model - Free Deployment to Facilitate Document Understanding Tasks

Home

Donut Demo

Developed by nielsr

VisionEncoderDecoder model fine-tuned on the CORD-v2 dataset for document understanding tasks

Text Recognition

Transformers

Open Source License:MIT #Document Understanding #Visual Encoder-Decoder #Receipt Recognition

Downloads 56

Release Time : 8/15/2022

Model Overview

This model is a document understanding model based on the Donut architecture, specifically fine-tuned for the CORD-v2 receipt dataset, capable of extracting structured information from document images.

Model Features

Document Understanding Capability

Able to extract structured information from complex document layouts

End-to-End Training

Uses VisionEncoderDecoder architecture for end-to-end training

Receipt Parsing

Optimized information extraction capability specifically for receipt documents

Model Capabilities

Document Image Understanding

Structured Information Extraction

Receipt Data Parsing

End-to-End Document Processing

Use Cases

Business Automation

Automated Receipt Processing

Automatically extracts product, price, and other information from receipt images

Replaces manual data entry, improving financial processing efficiency

Document Digitization

Document Information Extraction

Converts unstructured documents into structured data

Facilitates subsequent data analysis and processing

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Donut Demo

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Donut Demo

🚀 Quick Start

📄 License