X

Xlm Roberta Large Masakhaner

Developed by Davlan
The first named entity recognition model for 10 African languages, fine-tuned on XLM-RoBERTa large, supporting the recognition of four types of entities: dates, locations, organizations, and persons.
Downloads 104
Release Time : 3/2/2022

Model Overview

This model is a named entity recognition (NER) model optimized for African languages, fine-tuned on the MasakhaNER dataset, capable of handling text entity recognition tasks for 10 African languages.

Model Features

Multilingual Support
Supports named entity recognition for 10 African languages, filling the gap in African language NLP tools.
State-of-the-art Performance
Achieved the current optimal NER performance on the MasakhaNER dataset, with a maximum F1 score of 91.75 (Hausa).
Fine-grained Entity Classification
Can identify four types of entities (DATE/LOC/ORG/PER) and their starting positions (B-/I- tags).

Model Capabilities

African Language Text Processing
Named Entity Recognition
Multilingual NLP

Use Cases

News Analysis
African News Entity Extraction
Automatically extract key information such as persons, organizations, and locations from news articles in various African languages
F1 scores range from 70.70 to 91.75 (depending on the language)
Cross-lingual Information Processing
Multilingual Document Analysis
Process entity information in mixed texts containing multiple African languages
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase