T

Trocr Small Korean

Developed by team-lucid
TrOCR is a Korean image-to-text model based on a vision encoder-decoder architecture, using DeiT as the image encoder and RoBERTa as the text decoder.
Downloads 342
Release Time : 6/30/2023

Model Overview

This model is specifically designed to convert text in Korean images into editable text format, suitable for scenarios such as document digitization.

Model Features

Synthetic Data Training
Trained on 6 million synthetic images generated by synthtiger, covering diverse text scenarios.
Hybrid Architecture
Combines the strengths of DeiT vision encoder and RoBERTa text decoder for efficient image-to-text conversion.
TPU-Optimized Training
Training process supported by Google's TPU Research Cloud (TRC), ensuring efficiency for large-scale training.

Model Capabilities

Korean text recognition
Image-to-text conversion
Document digitization processing

Use Cases

Document Processing
Korean Document Digitization
Convert scanned Korean documents or images into editable text
Improves document processing efficiency and supports subsequent text analysis
Mobile Applications
Korean OCR Application
Integrated into mobile apps for real-time Korean text recognition
Enables users to quickly extract Korean text from images
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase