T

Trocr Large Stage1

Developed by microsoft
TrOCR is a Transformer-based pre-trained model for Optical Character Recognition (OCR) tasks.
Downloads 3,700
Release Time : 3/2/2022

Model Overview

TrOCR is an encoder-decoder model composed of an image Transformer encoder and a text Transformer decoder, specifically designed for optical character recognition of single-line text images.

Model Features

Transformer-based architecture
Utilizes advanced Transformer architecture, combining image and text processing capabilities.
Pre-trained model
Model weights are pre-trained and can be used directly or fine-tuned.
Single-line text recognition
Specifically optimized for optical character recognition of single-line text images.

Model Capabilities

Image-to-text
Optical Character Recognition
Single-line text recognition

Use Cases

Document digitization
Scanned document recognition
Convert scanned document images into editable text.
Automated processing
Form processing
Automatically recognize and extract text information from forms.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase