D

DA BERT Old News V1

Developed by CALDISS-AAU
The first transformer model trained on historical texts from Denmark's Absolute Monarchy period (1660-1849), developed by researchers at Aalborg University for processing historical texts with significant differences from modern Danish.
Downloads 48
Release Time : 4/1/2025

Model Overview

A BERT model pre-trained on MLM tasks, specifically optimized for historical texts from Denmark's Absolute Monarchy period, enabling better understanding and processing of texts that differ significantly from modern Danish.

Model Features

Historical text optimization
Specifically trained on historical texts from Denmark's Absolute Monarchy period (1660-1849), better capturing semantics that differ significantly from modern Danish.
Custom tokenizer
Uses a custom WordPiece tokenizer optimized for tokenizing historical texts.
High-quality training data
Training data sourced from the ENO corpus, containing news, announcements, and advertisements from Danish and Norwegian newspapers between 1762 and 1848, with a word-level error rate of approximately 5%.

Model Capabilities

Masked language modeling
Historical text semantic understanding

Use Cases

Historical research
Historical text analysis
Used to analyze historical texts from Denmark's Absolute Monarchy period, helping researchers understand language usage and social context of the time.
Historical document translation assistance
Assists in translating historical documents by providing more accurate semantic understanding.
Linguistics
Language evolution research
Used to study the evolution of Danish from the Absolute Monarchy period to modern times.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase