Trocr Small Korean
T
Trocr Small Korean
Developed by team-lucid
TrOCR is a Korean image-to-text model based on a vision encoder-decoder architecture, using DeiT as the image encoder and RoBERTa as the text decoder.
Downloads 342
Release Time : 6/30/2023
Model Overview
This model is specifically designed to convert text in Korean images into editable text format, suitable for scenarios such as document digitization.
Model Features
Synthetic Data Training
Trained on 6 million synthetic images generated by synthtiger, covering diverse text scenarios.
Hybrid Architecture
Combines the strengths of DeiT vision encoder and RoBERTa text decoder for efficient image-to-text conversion.
TPU-Optimized Training
Training process supported by Google's TPU Research Cloud (TRC), ensuring efficiency for large-scale training.
Model Capabilities
Korean text recognition
Image-to-text conversion
Document digitization processing
Use Cases
Document Processing
Korean Document Digitization
Convert scanned Korean documents or images into editable text
Improves document processing efficiency and supports subsequent text analysis
Mobile Applications
Korean OCR Application
Integrated into mobile apps for real-time Korean text recognition
Enables users to quickly extract Korean text from images
Featured Recommended AI Models
Š 2025AIbase