CareerBERT - JGオープンソース求職モデル - 職業相談と的確な求職推薦をサポート

ホーム

Careerbert Jg

lwolfrum2によって開発

CareerBERT-JGは、ESCO分類法に基づいて微調整された文章変換器モデルで、職業相談や求職推薦のシーンに特化して設計されています。

テキスト埋め込みドイツ語#職業推薦 #ESCO分類 #履歴書マッチング

ダウンロード数 309

リリース時間 : 2/26/2025

モデル概要

このモデルはagne/jobGBERTをベースにしており、文章の類似度を計算でき、職業相談や求職推薦などのアプリケーションをサポートします。

モデル特徴

ESCO分類法による微調整

ヨーロッパの技能、能力、職業分類体系で専用に微調整されており、ヨーロッパの就職市場分析に適しています。

職業埋め込み空間

履歴書と職務内容を共有の埋め込み空間にマッピングし、正確なマッチングを実現します。

効率的なプーリング処理

単語の埋め込みを処理するために平均プーリング方法を採用し、注意力マスクを考慮して精度を確保します。

モデル能力

文章の埋め込み生成

テキストの類似度計算

職業関連性分析

履歴書と職務のマッチング

使用事例

職業相談

履歴書と職務のマッチング

求職者の履歴書内容に基づいて、最も関連性の高いESCO職業分類を推薦します。

専門家の評価では、従来の方法よりも優れたマッチング結果を示しました。

職業開発アドバイス

現有的なスキルと目標職務の要求とのマッチング度を分析します。

職業相談員がデータ駆動型のアドバイスを提供するのに役立ちます。

採用システム

自動化された履歴書選別

大量の履歴書と職務要求を迅速にマッチングします。

HR部門の作業効率を向上させます。

🚀 CareerBERT-JG

このモデルは、ESCO Taxonomy でファインチューニングされた文章埋め込みモデルです。ベースモデルは agne/jobGBERT です。

対応する論文はこちら：https://www.sciencedirect.com/science/article/pii/S0957417425006657

🚀 クイックスタート

✨ 主な機能

文章の類似度を計算することができます。
特徴抽出に使用できます。

📦 インストール

sentence-transformers をインストールすることで、このモデルを簡単に使用できます。

pip install -U sentence-transformers

💻 使用例

基本的な使用法

from sentence_transformers import SentenceTransformer
sentences = ["This is an example sentence", "Each sentence is converted"]

model = SentenceTransformer('{MODEL_NAME}')
embeddings = model.encode(sentences)
print(embeddings)

高度な使用法

sentence-transformers を使用せずに、このモデルを使用することもできます。まず、入力をトランスフォーマーモデルに通し、その後、文脈化された単語埋め込みに対して適切なプーリング操作を適用する必要があります。

from transformers import AutoTokenizer, AutoModel
import torch


#Mean Pooling - Take attention mask into account for correct averaging
def mean_pooling(model_output, attention_mask):
    token_embeddings = model_output[0] #First element of model_output contains all token embeddings
    input_mask_expanded = attention_mask.unsqueeze(-1).expand(token_embeddings.size()).float()
    return torch.sum(token_embeddings * input_mask_expanded, 1) / torch.clamp(input_mask_expanded.sum(1), min=1e-9)


# Sentences we want sentence embeddings for
sentences = ['This is an example sentence', 'Each sentence is converted']

# Load model from HuggingFace Hub
tokenizer = AutoTokenizer.from_pretrained('{MODEL_NAME}')
model = AutoModel.from_pretrained('{MODEL_NAME}')

# Tokenize sentences
encoded_input = tokenizer(sentences, padding=True, truncation=True, return_tensors='pt')

# Compute token embeddings
with torch.no_grad():
    model_output = model(**encoded_input)

# Perform pooling. In this case, mean pooling.
sentence_embeddings = mean_pooling(model_output, encoded_input['attention_mask'])

print("Sentence embeddings:")
print(sentence_embeddings)

📚 ドキュメント

評価結果

このモデルの自動評価については、Sentence Embeddings Benchmark を参照してください：https://seb.sbert.net

学習

このモデルは以下のパラメータで学習されました。

DataLoader: torch.utils.data.dataloader.DataLoader の長さは 3695 で、以下のパラメータが使用されました。

{'batch_size': 32, 'sampler': 'torch.utils.data.sampler.RandomSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}

Loss: sentence_transformers.losses.MultipleNegativesRankingLoss.MultipleNegativesRankingLoss が使用され、以下のパラメータが設定されました。

{'scale': 20.0, 'similarity_fct': 'cos_sim'}

fit() メソッドのパラメータ:

{
    "epochs": 1,
    "evaluation_steps": 0,
    "evaluator": "sentence_transformers.evaluation.RerankingEvaluator.RerankingEvaluator",
    "max_grad_norm": 1,
    "optimizer_class": "<class 'torch.optim.adamw.AdamW'>",
    "optimizer_params": {
        "lr": 2e-05
    },
    "scheduler": "WarmupLinear",
    "steps_per_epoch": null,
    "warmup_steps": 11821.1,
    "weight_decay": 0.01
}

モデルのアーキテクチャ

SentenceTransformer(
  (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False})
)

📄 ライセンス

このモデルの引用方法は以下の通りです。

@article{ROSENBERGER2025127043,
title = {CareerBERT: Matching resumes to ESCO jobs in a shared embedding space for generic job recommendations},
journal = {Expert Systems with Applications},
volume = {275},
pages = {127043},
year = {2025},
issn = {0957-4174},
doi = {https://doi.org/10.1016/j.eswa.2025.127043},
url = {https://www.sciencedirect.com/science/article/pii/S0957417425006657},
author = {Julian Rosenberger and Lukas Wolfrum and Sven Weinzierl and Mathias Kraus and Patrick Zschech},
keywords = {Job consultation, Job markets, Job recommendation system, BERT, NLP},
abstract = {The rapidly evolving labor market, driven by technological advancements and economic shifts, presents significant challenges for traditional job matching and consultation services. In response, we introduce an advanced support tool for career counselors and job seekers based on CareerBERT, a novel approach that leverages the power of unstructured textual data sources, such as resumes, to provide more accurate and comprehensive job recommendations. In contrast to previous approaches that primarily focus on job recommendations based on a fixed set of concrete job advertisements, our approach involves the creation of a corpus that combines data from the European Skills, Competences, and Occupations (ESCO) taxonomy and EURopean Employment Services (EURES) job advertisements, ensuring an up-to-date and well-defined representation of general job titles in the labor market. Our two-step evaluation approach, consisting of an application-grounded evaluation using EURES job advertisements and a human-grounded evaluation using real-world resumes and Human Resources (HR) expert feedback, provides a comprehensive assessment of CareerBERTâ€™s performance. Our experimental results demonstrate that CareerBERT outperforms both traditional and state-of-the-art embedding approaches while showing robust effectiveness in human expert evaluations. These results confirm the effectiveness of CareerBERT in supporting career consultants by generating relevant job recommendations based on resumes, ultimately enhancing the efficiency of job consultations and expanding the perspectives of job seekers. This research contributes to the field of NLP and job recommendation systems, offering valuable insights for both researchers and practitioners in the domain of career consulting and job matching.}
}