🚀 GLiNER-small モデルカード
GLiNERは、双方向トランスフォーマーエンコーダ(BERTのようなもの)を使用して、任意のエンティティタイプを識別できる固有表現認識(NER)モデルです。これは、事前定義されたエンティティに限定される従来のNERモデルや、柔軟性はあるがリソース制約のあるシナリオではコストが高く大規模な大規模言語モデル(LLM)に代わる実用的な選択肢を提供します。
このバージョンは、Pile-NERデータセットで(研究目的で)学習されています。
📚 詳細ドキュメント
🔗 リンク
💾 利用可能なモデル
📦 インストール
このモデルを使用するには、GLiNERのPythonライブラリをインストールする必要があります。
!pip install gliner
💻 使用例
基本的な使用法
from gliner import GLiNER
model = GLiNER.from_pretrained("urchade/gliner_small-v1")
text = """
Cristiano Ronaldo dos Santos Aveiro (Portuguese pronunciation: [kɾiʃˈtjɐnu ʁɔˈnaldu]; born 5 February 1985) is a Portuguese professional footballer who plays as a forward for and captains both Saudi Pro League club Al Nassr and the Portugal national team. Widely regarded as one of the greatest players of all time, Ronaldo has won five Ballon d'Or awards,[note 3] a record three UEFA Men's Player of the Year Awards, and four European Golden Shoes, the most by a European player. He has won 33 trophies in his career, including seven league titles, five UEFA Champions Leagues, the UEFA European Championship and the UEFA Nations League. Ronaldo holds the records for most appearances (183), goals (140) and assists (42) in the Champions League, goals in the European Championship (14), international goals (128) and international appearances (205). He is one of the few players to have made over 1,200 professional career appearances, the most by an outfield player, and has scored over 850 official senior career goals for club and country, making him the top goalscorer of all time.
"""
labels = ["person", "award", "date", "competitions", "teams"]
entities = model.predict_entities(text, labels)
for entity in entities:
print(entity["text"], "=>", entity["label"])
Cristiano Ronaldo dos Santos Aveiro => person
5 February 1985 => date
Al Nassr => teams
Portugal national team => teams
Ballon d'Or => award
UEFA Men's Player of the Year Awards => award
European Golden Shoes => award
UEFA Champions Leagues => competitions
UEFA European Championship => competitions
UEFA Nations League => competitions
Champions League => competitions
European Championship => competitions
📊 固有表現認識のベンチマーク結果

👨💻 モデルの作成者
モデルの作成者は以下の通りです。
📄 引用
@misc{zaratiana2023gliner,
title={GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer},
author={Urchade Zaratiana and Nadi Tomeh and Pierre Holat and Thierry Charnois},
year={2023},
eprint={2311.08526},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
📄 ライセンス
このモデルは cc-by-nc-4.0 ライセンスの下で提供されています。