court-records-htr开源手写文字识别模型 - 免费识别19世纪芬兰语瑞典语法庭记录

首页

Court Records Htr

由 Kansallisarkisto 开发

基于微软TrOCR微调的手写文字识别模型，专用于19世纪芬兰语和瑞典语法庭记录文档

文字识别

PyTorch

开源协议:MIT #历史手写体识别 #芬兰语瑞典语OCR #法庭档案数字化

下载量 24

发布时间 : 9/12/2024

模型简介

该模型用于从文本行图像中识别手写文字，特别针对19世纪芬兰语和瑞典语的数字化法庭记录文档进行了优化训练。

模型特点

历史文档专项优化

专门针对19世纪手写体特点进行训练，在历史文档识别任务上表现优异

多语言支持

同时支持芬兰语和瑞典语的手写识别

高精度识别

在验证集上达到2.4%的字错误率和11.3%的词错误率

模型能力

手写文字识别

历史文档处理

多语言文本提取

使用案例

历史档案数字化

法庭记录转录

将19世纪手写法庭记录转换为可搜索的数字文本

实现高精度自动转录，字错误率仅2.4%

家谱研究

历史人口记录处理

自动识别历史人口登记簿中的手写信息

🚀 芬兰19世纪法庭记录手写文本识别模型

该模型用于从文本行图像中进行手写文本识别。它通过对微软的TrOCR模型进行微调训练，使用了数字化的19世纪芬兰语和瑞典语法庭记录文档。

🚀 快速开始

此模型可按以下代码预测图像中的文本内容。若有可用的GPU，建议在推理时使用。

from transformers import TrOCRProcessor, VisionEncoderDecoderModel
from PIL import Image
import torch

# Use GPU if available
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

# Model location in Huggingface Hub
model_checkpoint = "Kansallisarkisto/court-records-htr"
# Path to textline image
line_image_path = "/path/to/textline_image.jpg"

# Initialize processor and model
processor = TrOCRProcessor.from_pretrained(model_checkpoint)
model = VisionEncoderDecoderModel.from_pretrained(model_checkpoint).to(device)

# Open image file and extract pixel values
image = Image.open(line_image_path).convert("RGB")
pixel_values = processor(image, return_tensors="pt").pixel_values

# Use the model to generate predictions 
generated_ids = model.generate(pixel_values.to(device))
# Use the processor to decode ids to text
generated_text = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(generated_text)