T

Trocr Base Printed

Developed by microsoft
TrOCR is a Transformer-based optical character recognition model designed for single-line text image recognition, employing an encoder-decoder architecture
Downloads 184.84k
Release Time : 3/2/2022

Model Overview

This model combines an image Transformer encoder and a text Transformer decoder, suitable for optical character recognition tasks on printed text, specifically fine-tuned for the SROIE dataset

Model Features

Transformer Architecture
Utilizes advanced Transformer architecture to process image and text sequences, enabling end-to-end OCR
Pre-trained Weight Initialization
Image encoder uses BEiT pre-trained weights, text decoder uses RoBERTa pre-trained weights
Printed Text Optimization
Specifically optimized for printed text recognition, performs well on the SROIE dataset

Model Capabilities

Single-line text image recognition
Printed character recognition
End-to-end OCR processing

Use Cases

Document Digitization
Receipt Recognition
Automatically recognize text information in scanned receipts
Performs well on the SROIE dataset
Invoice Processing
Extract key field information from invoice images
Suitable for structured document processing
Office Automation
Form Recognition
Convert printed forms into editable text
Ideal for processing well-formatted documents
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase