T

Trocr Base Handwritten Hist Swe 2

Developed by Riksarkivet
A historical handwriting recognition model jointly developed by the Swedish National Archives and other institutions, specifically designed for Swedish handwritten texts from 1600-1900.
Downloads 5,765
Release Time : 8/15/2024

Model Overview

A handwriting text recognition model based on the TrOCR architecture, suitable for recognizing Swedish continuous handwritten texts from the early 17th to the late 19th century.

Model Features

Historical Handwriting Optimization
Specially optimized for Swedish historical handwriting from 1600-1900.
Multi-domain Dataset
Trained on multiple historical document datasets from the Swedish National Archives, covering various fields such as law, administration, and mining.
Line-level Text Recognition
Focuses on line-level text recognition and can be used in conjunction with text line segmentation processes.

Model Capabilities

Handwritten text recognition
Historical document transcription
Swedish text extraction

Use Cases

Historical Document Digitization
Court Judgment Transcription
Transcription of Swedish court judgments from the 17th-19th centuries.
CER as low as 0.0075 (War Appeal Court dataset).
Historical Letter Processing
Batch processing of historical letter collections.
Recommended to fine-tune with 20-50 annotated documents before processing.
Archive Management
Police Archive Transcription
Processing records from the Gothenburg Police Department archives (1850-1900).
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase