Afro-xlmr-miniオープンソース言語モデル - 17種のアフリカ言語と3種の高リソース言語のアプリケーションをサポート

ホーム

Afro Xlmr Mini

Davlanによって開発

AfroXLMR-miniは、XLM-R-miniLMモデルを17のアフリカ言語でマスク言語モデル（MLM）適応訓練することで作成され、アフリカの主要語族と3つの高リソース言語（アラビア語、フランス語、英語）をカバーしています。

大規模言語モデル

Transformers

オープンソースライセンス:MIT #アフリカ言語最適化 #多言語転移学習 #軽量モデル

ダウンロード数 66

リリース時間 : 4/13/2022

モデル概要

このモデルはアフリカ言語に最適化された多言語事前訓練言語モデルで、多言語適応微調整によりアフリカ言語での性能を向上させています。

モデル特徴

アフリカ言語最適化

17のアフリカ言語に特化して最適化訓練され、アフリカ言語タスクでの性能を向上

多言語適応

多言語適応微調整（MAFT）手法により、単言語適応とクロスランゲージ転移能力のバランスを実現

効率的なアーキテクチャ

XLM-R-miniLMベースの軽量アーキテクチャで、性能を維持しながらモデルサイズを削減

モデル能力

多言語テキスト理解

固有表現認識

ニューストピック分類

感情分析

使用事例

自然言語処理

アフリカ言語固有表現認識

MasakhaNERデータセットでのアフリカ言語エンティティ認識

ハウサ語で87.7 F1値、ナイジェリアピジン語で85.5 F1値を達成

クロスランゲージ転移学習

下流のアフリカ言語NLPタスクをサポートするベースモデルとして

ゼロショットクロスランゲージ転移をサポート

🚀 afro-xlmr-mini

AfroXLMR-miniは、主要なアフリカの言語ファミリーをカバーする17のアフリカ言語（アフリカーンス語、アムハラ語、ハウサ語、イボ語、マラガシ語、チチェワ語、オロモ語、ナイジャ語、キニヤルワンダ語、キルンディ語、ショナ語、ソマリ語、セソト語、スワヒリ語、イシコサ語、ヨルバ語、イシズール語）と3つの高リソース言語（アラビア語、フランス語、英語）で、XLM - R - miniLMモデルをMLM適応させることで作成されました。

✨ 主な機能

MasakhaNERでの評価結果 (F値)

言語	XLM - R - miniLM	XLM - R - base	XLM - R - large	afro - xlmr - base	afro - xlmr - small	afro - xlmr - mini
amh	69.5	70.6	76.2	76.1	70.1	69.7
hau	74.5	89.5	90.5	91.2	91.4	87.7
ibo	81.9	84.8	84.1	87.4	86.6	83.5
kin	68.6	73.3	73.8	78.0	77.5	74.1
lug	64.7	79.7	81.6	82.9	83.2	77.4
luo	11.7	74.9	73.6	75.1	75.4	17.5
pcm	83.2	87.3	89.0	89.6	89.0	85.5
swa	86.3	87.4	89.4	88.6	88.7	86.0
wol	51.7	63.9	67.9	67.4	65.9	59.0
yor	72.0	78.3	78.9	82.1	81.3	75.1

BibTeXエントリと引用情報

@inproceedings{alabi-etal-2022-adapting,
    title = "Adapting Pre-trained Language Models to {A}frican Languages via Multilingual Adaptive Fine-Tuning",
    author = "Alabi, Jesujoba O.  and
      Adelani, David Ifeoluwa  and
      Mosbach, Marius  and
      Klakow, Dietrich",
    booktitle = "Proceedings of the 29th International Conference on Computational Linguistics",
    month = oct,
    year = "2022",
    address = "Gyeongju, Republic of Korea",
    publisher = "International Committee on Computational Linguistics",
    url = "https://aclanthology.org/2022.coling-1.382",
    pages = "4336--4349",
    abstract = "Multilingual pre-trained language models (PLMs) have demonstrated impressive performance on several downstream tasks for both high-resourced and low-resourced languages. However, there is still a large performance drop for languages unseen during pre-training, especially African languages. One of the most effective approaches to adapt to a new language is language adaptive fine-tuning (LAFT) {---} fine-tuning a multilingual PLM on monolingual texts of a language using the pre-training objective. However, adapting to target language individually takes large disk space and limits the cross-lingual transfer abilities of the resulting models because they have been specialized for a single language. In this paper, we perform multilingual adaptive fine-tuning on 17 most-resourced African languages and three other high-resource languages widely spoken on the African continent to encourage cross-lingual transfer learning. To further specialize the multilingual PLM, we removed vocabulary tokens from the embedding layer that corresponds to non-African writing scripts before MAFT, thus reducing the model size by around 50{\%}. Our evaluation on two multilingual PLMs (AfriBERTa and XLM-R) and three NLP tasks (NER, news topic classification, and sentiment classification) shows that our approach is competitive to applying LAFT on individual languages while requiring significantly less disk space. Additionally, we show that our adapted PLM also improves the zero-shot cross-lingual transfer abilities of parameter efficient fine-tuning methods.",
}