donut_pdf_ocr Open-source OCR Model - Free and Efficient Text Recognition for PDF Documents

Home

Donut Pdf Ocr

Developed by shubh1608

OCR model trained on image folder datasets for text recognition in PDF documents

Text Recognition

Transformers

#PDF Document OCR #High-precision Text Recognition #Image to Text Conversion

Downloads 67

Release Time : 4/17/2023

Model Overview

This model is an Optical Character Recognition (OCR) model specifically designed to extract text content from PDF document images. It achieves high-precision text recognition through deep learning technology.

Model Features

High-precision OCR

Achieved a low loss value of 0.0443 on the evaluation set, indicating high recognition accuracy.

End-to-End Training

The model adopts an end-to-end training approach, directly outputting text from images.

PDF Document Optimization

Specifically optimized for training on PDF document images.

Model Capabilities

PDF Document Image Text Recognition

Multi-format Text Output

Document Structure Analysis

Use Cases

Document Digitization

PDF Document Conversion

Convert scanned PDF documents into editable text formats.

Highly accurate text conversion.

Office Automation

Document Information Extraction

Automatically extract key information from contracts, invoices, and other documents.

Improved data processing efficiency.

Training Loss	Epoch	Step	Validation Loss
0.0829	1.0	47	0.1157
0.0184	2.0	94	0.1660
0.0533	3.0	141	0.0765
0.0765	4.0	188	0.0530
0.101	5.0	235	0.0481
0.0936	6.0	282	0.0494
0.1032	7.0	329	0.0524
0.0033	8.0	376	0.0460
0.0185	9.0	423	0.0440
0.0044	10.0	470	0.0443

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Donut Pdf Ocr

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 donut_pdf_ocr

🚀 Quick Start

🔧 Technical Details

Training hyperparameters

Training results

Framework versions