N

Nel Mgenre Multilingual

Developed by impresso-project
A multilingual generative entity retrieval model based on mGENRE, optimized for historical texts, supporting 100+ languages with special adaptation for French, German, and English historical document entity linking.
Downloads 17.13k
Release Time : 4/9/2024

Model Overview

This model employs the mBART architecture, using constrained generation techniques to link named entities in text to Wikidata entities, particularly suited for handling OCR noise and variant names in historical documents.

Model Features

Multilingual Support
Supports entity linking in 100+ languages, with special optimization for historical text processing in French, German, and English.
Historical Text Adaptation
Specifically optimized for OCR noise and name variants in historical documents.
Constrained Generation Technique
Uses constrained beam search to directly output entity names mapped to Wikidata/QIDs.
Cross-Era Linking
Accurately links historical names to modern Wikidata entities.

Model Capabilities

Multilingual entity recognition
Named entity disambiguation
Historical name linking
Text-to-entity generation
Entity recognition in OCR noise environments

Use Cases

Historical Archive Processing
Historical Newspaper Analysis
Extract and link entities such as people and places from historical newspapers.
Accurately identifies and links entity names affected by OCR noise.
Biography Generation Assistance
Assists in generating biographies by linking historical figures to entities.
Establishes connections between historical figures and modern knowledge bases.
Cross-Era Knowledge Association
Historical Event Analysis
Links participants in historical events to a unified knowledge base.
Integrates historical events with modern knowledge graphs.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase