MLuke-Base-Lite Open-Source Model - Free Support for Text Processing Tasks in 24 Languages

Mluke Base Lite

Developed by studio-ousia

mLUKE is a multilingual extension of LUKE, supporting text processing tasks in 24 languages

Supports Multiple LanguagesOpen Source License:Apache-2.0 #Multilingual Named Entity Recognition #Cross-lingual Relation Classification #Wikipedia Pretraining

Downloads 153

Release Time : 4/13/2022

Model Overview

mLUKE is a multilingual pretrained language model based on the XLM-RoBERTa(base) architecture, with Wikipedia entity embeddings removed in this lite version, suitable for multilingual natural language processing tasks

Model Features

Multilingual Support

Supports text processing in 24 languages, including major Asian, European and African languages

Lite Architecture

Removes Wikipedia entity embeddings while retaining core language understanding capabilities

Entity-aware Attention

Optional entity-aware attention mechanism to enhance understanding of named entities

Model Capabilities

Multilingual text understanding

Named Entity Recognition

Relation Classification

Question Answering Systems

Use Cases

Information Extraction

Multilingual Document Entity Recognition

Extract named entities such as person names and locations from multilingual documents

Knowledge Graph Construction

Cross-lingual Entity Relation Extraction

Identify relationships between entities in multilingual texts

🚀 mLUKE

mLUKE (multilingual LUKE) is a multilingual extension of LUKE, which can be applied to tasks like named entity recognition, relation classification, and question answering.

Please check the official repository for more details and updates.

✨ Features

Multilingual Support: It supports multiple languages including Arabic, Bengali, German, Greek, English, Spanish, Finnish, French, Hindi, Indonesian, Italian, Japanese, Korean, Dutch, Polish, Portuguese, Russian, Swedish, Swahili, Telugu, Thai, Turkish, Vietnamese, and Chinese.
Model Architecture: This is the mLUKE base model with 12 hidden layers and a 768 hidden size. The total number of parameters in this model is 279M.
Initialization and Training: The model was initialized with the weights of XLM - RoBERTa(base) and trained using the December 2020 version of Wikipedia in 24 languages.
Lite - weight Version: It is a lite - weight version of studio - ousia/mluke - base, without Wikipedia entity embeddings but only with special entities such as [MASK].

📚 Documentation

Note

When you load the model from AutoModel.from_pretrained with the default configuration, you will see the following warning:

Some weights of the model checkpoint at studio-ousia/mluke-base-lite were not used when initializing LukeModel: [
'luke.encoder.layer.0.attention.self.w2e_query.weight', 'luke.encoder.layer.0.attention.self.w2e_query.bias', 
'luke.encoder.layer.0.attention.self.e2w_query.weight', 'luke.encoder.layer.0.attention.self.e2w_query.bias', 
'luke.encoder.layer.0.attention.self.e2e_query.weight', 'luke.encoder.layer.0.attention.self.e2e_query.bias', 
...]

These weights are the weights for entity - aware attention (as described in the LUKE paper). This is expected because use_entity_aware_attention is set to false by default, but the pretrained weights contain the weights for it in case you enable use_entity_aware_attention and have the weights loaded into the model.

Citation

If you find mLUKE useful for your work, please cite the following paper:

@inproceedings{ri-etal-2022-mluke,
    title = "m{LUKE}: {T}he Power of Entity Representations in Multilingual Pretrained Language Models",
    author = "Ri, Ryokan  and
      Yamada, Ikuya  and
      Tsuruoka, Yoshimasa",
    booktitle = "Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    year = "2022",
    url = "https://aclanthology.org/2022.acl-long.505",

📄 License

This model is licensed under the Apache - 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご