Nel Mgenre Multilingual
A multilingual generative entity retrieval model based on mGENRE, optimized for historical texts, supporting 100+ languages with special adaptation for French, German, and English historical document entity linking.
Knowledge Graph
Transformers Supports Multiple Languages#Historical Text Entity Linking#Multilingual Entity Disambiguation#Wikidata Mapping

Downloads 17.13k
Release Time : 4/9/2024
Model Overview
This model employs the mBART architecture, using constrained generation techniques to link named entities in text to Wikidata entities, particularly suited for handling OCR noise and variant names in historical documents.
Model Features
Multilingual Support
Supports entity linking in 100+ languages, with special optimization for historical text processing in French, German, and English.
Historical Text Adaptation
Specifically optimized for OCR noise and name variants in historical documents.
Constrained Generation Technique
Uses constrained beam search to directly output entity names mapped to Wikidata/QIDs.
Cross-Era Linking
Accurately links historical names to modern Wikidata entities.
Model Capabilities
Multilingual entity recognition
Named entity disambiguation
Historical name linking
Text-to-entity generation
Entity recognition in OCR noise environments
Use Cases
Historical Archive Processing
Historical Newspaper Analysis
Extract and link entities such as people and places from historical newspapers.
Accurately identifies and links entity names affected by OCR noise.
Biography Generation Assistance
Assists in generating biographies by linking historical figures to entities.
Establishes connections between historical figures and modern knowledge bases.
Cross-Era Knowledge Association
Historical Event Analysis
Links participants in historical events to a unified knowledge base.
Integrates historical events with modern knowledge graphs.
Featured Recommended AI Models