reranker-bert-tiny-gooaq-bceオープンソースモデル - テキストの類似度を計算し、意味検索などのタスクに使用

ホーム

Reranker Bert Tiny Gooaq Bce

cross-encoder-testingによって開発

これはbert-tinyから微調整されたクロスエンコーダモデルで、テキストペアの類似度スコアを計算するために使用され、意味的テキスト類似度、意味的検索などのさまざまなタスクに適しています。

テキスト埋め込み

Safetensors

英語オープンソースライセンス:Apache-2.0 #質問応答の再ランキング #軽量BERT #意味的マッチング

ダウンロード数 37.19k

リリース時間 : 2/26/2025

モデル概要

このモデルはBERT-tinyアーキテクチャに基づいており、sentence-transformersライブラリを使用して開発されています。主にテキストペアの類似度スコアを計算するために使用され、意味的テキスト類似度、意味的検索、复述マイニング、テキスト分類、クラスタリングなどのタスクに適しています。

モデル特徴

効率的で軽量

BERT-tinyアーキテクチャに基づいており、モデルのサイズが小さく、計算効率が高いです。

多タスク対応

意味的テキスト類似度、意味的検索、复述マイニング、テキスト分類などのさまざまなタスクに使用できます。

高性能

複数の評価データセットで良好な性能を示し、特にGooAQ-devデータセットではmapが0.5677に達しています。

モデル能力

テキスト類似度の計算

意味的検索

テキスト分類

テキストクラスタリング

复述マイニング

使用事例

情報検索

質問応答システムの回答ランキング

候補回答を関連性でランキングし、質問応答システムの品質を向上させます。

GooAQ-devデータセットでmapが0.5677に達しています。

コンテンツ推薦

🚀 GooAQ上で訓練されたBERT-tinyモデル

このモデルは、Cross Encoderに基づいており、sentence-transformersライブラリを使用してprajjwal1/bert-tinyから微調整されました。このモデルは、テキストペアに対してスコアを計算することができ、意味的なテキスト類似度、意味検索、言い換えマイニング、テキスト分類、クラスタリングなどのタスクに使用できます。

このモデルは、train_script.pyを使用して訓練されました。

🚀 クイックスタート

このモデルは、Cross Encoderに基づく微調整モデルで、テキストペアのスコアを計算するために使用でき、意味的なテキスト類似度、意味検索などのさまざまなタスクに適しています。以下の手順に従って、このモデルを使用することができます。

✨ 主な機能

クロスエンコーダモデル：Cross Encoderアーキテクチャに基づいており、テキストペアのスコアを効果的に計算することができます。
事前学習モデルからの微調整：prajjwal1/bert-tinyから微調整されており、事前学習モデルの利点を生かしています。
多タスク適用性：意味的なテキスト類似度、意味検索、言い換えマイニング、テキスト分類、クラスタリングなどのさまざまなタスクに使用できます。

📦 インストール

まず、Sentence Transformersライブラリをインストールする必要があります。

pip install -U sentence-transformers

💻 使用例

基本的な使用法

以下は、このモデルを使用してテキストペアのスコアを計算する例です。

from sentence_transformers import CrossEncoder

# 🤗 Hubからモデルをダウンロード
model = CrossEncoder("cross-encoder-testing/reranker-bert-tiny-gooaq-bce")
# テキストペアを定義
pairs = [
    ['are javascript developers in demand?', "JavaScript is the skill that is most in-demand for IT in 2020, according to a report from developer skills tester DevSkiller. The report, “Top IT Skills report 2020: Demand and Hiring Trends,” has JavaScript switching places with Java when compared to last year's report, with Java in third place this year, behind SQL."],
    ['are javascript developers in demand?', 'In one line difference between the two is: JavaScript is the programming language where as AngularJS is a framework based on JavaScript. ... It is also the basic for all java script based technologies like jquery, angular JS, bootstrap JS and so on. Angular JS is a framework written in javascript and uses MVC architecture.'],
    ['are javascript developers in demand?', 'Java applications are run in a virtual machine or web browser while JavaScript is run on a web browser. Java code is compiled whereas while JavaScript code is in text and in a web page. JavaScript is an OOP scripting language, whereas Java is an OOP programming language.'],
    ['are javascript developers in demand?', 'Things in the body tag are the things that should be displayed: the actual content. Javascript in the body is executed as it is read and as the page is rendered. Javascript in the head is interpreted before anything is rendered.'],
    ['are javascript developers in demand?', 'Web apps tend to be built using JavaScript, CSS and HTML5. Unlike mobile apps, there is no standard software development kit for building web apps. However, developers do have access to templates. Compared to mobile apps, web apps are usually quicker and easier to build — but they are much simpler in terms of features.'],
]
# スコアを予測
scores = model.predict(pairs)
print(scores.shape)
# (5,)

# または、単一のテキストとの類似度に基づいて異なるテキストを並べ替える
ranks = model.rank(
    'are javascript developers in demand?',
    [
        "JavaScript is the skill that is most in-demand for IT in 2020, according to a report from developer skills tester DevSkiller. The report, “Top IT Skills report 2020: Demand and Hiring Trends,” has JavaScript switching places with Java when compared to last year's report, with Java in third place this year, behind SQL.",
        'In one line difference between the two is: JavaScript is the programming language where as AngularJS is a framework based on JavaScript. ... It is also the basic for all java script based technologies like jquery, angular JS, bootstrap JS and so on. Angular JS is a framework written in javascript and uses MVC architecture.',
        'Java applications are run in a virtual machine or web browser while JavaScript is run on a web browser. Java code is compiled whereas while JavaScript code is in text and in a web page. JavaScript is an OOP scripting language, whereas Java is an OOP programming language.',
        'Things in the body tag are the things that should be displayed: the actual content. Javascript in the body is executed as it is read and as the page is rendered. Javascript in the head is interpreted before anything is rendered.',
        'Web apps tend to be built using JavaScript, CSS and HTML5. Unlike mobile apps, there is no standard software development kit for building web apps. However, developers do have access to templates. Compared to mobile apps, web apps are usually quicker and easier to build — but they are much simpler in terms of features.',
    ]
)
# [{'corpus_id': ..., 'score': ...}, {'corpus_id': ..., 'score': ...}, ...]

📚 ドキュメント

モデル詳細

属性	詳細
モデルタイプ	Cross Encoder
ベースモデル	prajjwal1/bert-tiny
最大シーケンス長	512 tokens
出力ラベル数	1 個のラベル
言語	英語
ライセンス	apache-2.0

モデルリソース

ドキュメント：Sentence Transformersドキュメント
ドキュメント：Cross Encoderドキュメント
リポジトリ：GitHub上のSentence Transformers
Hugging Face：Hugging Face上のCross Encoders

評価指標

Cross Encoderの再ランキング

データセット：gooaq-dev、NanoMSMARCO、NanoNFCorpus および NanoNQ
CrossEncoderRerankingEvaluatorを使用して評価

指標	gooaq-dev	NanoMSMARCO	NanoNFCorpus	NanoNQ
map	0.5677 (+0.0366)	0.4280 (-0.0616)	0.3397 (+0.0787)	0.4149 (-0.0047)
mrr@10	0.5558 (+0.0318)	0.4129 (-0.0646)	0.5196 (+0.0198)	0.4132 (-0.0135)
ndcg@10	0.6157 (+0.0245)	0.4772 (-0.0632)	0.3308 (+0.0058)	0.4859 (-0.0147)

Cross Encoder Nano BEIR

データセット：NanoBEIR_R100_mean
CrossEncoderNanoBEIREvaluatorを使用して評価

指標	値
map	0.3942 (+0.0041)
mrr@10	0.4486 (-0.0194)
ndcg@10	0.4313 (-0.0241)

訓練詳細

訓練データセット

未命名データセット

サイズ：578,402個の訓練サンプル
列：question、answer および label

最初の1000個のサンプルに基づく近似統計情報：

	質問	回答	ラベル
タイプ	文字列	文字列	整数
詳細	最小：21文字平均：43.81文字最大：96文字	最小：51文字平均：252.46文字最大：405文字	0：~82.90% 1：~17.10%

サンプル：

質問	回答	ラベル
`are javascript developers in demand?`	`JavaScript is the skill that is most in-demand for IT in 2020, according to a report from developer skills tester DevSkiller. The report, “Top IT Skills report 2020: Demand and Hiring Trends,” has JavaScript switching places with Java when compared to last year's report, with Java in third place this year, behind SQL.`	`1`
`are javascript developers in demand?`	`In one line difference between the two is: JavaScript is the programming language where as AngularJS is a framework based on JavaScript. ... It is also the basic for all java script based technologies like jquery, angular JS, bootstrap JS and so on. Angular JS is a framework written in javascript and uses MVC architecture.`	`0`
`are javascript developers in demand?`	`Java applications are run in a virtual machine or web browser while JavaScript is run on a web browser. Java code is compiled whereas while JavaScript code is in text and in a web page. JavaScript is an OOP scripting language, whereas Java is an OOP programming language.`	`0`

損失関数：BinaryCrossEntropyLoss、パラメータは以下の通り：

{
    "activation_fct": "torch.nn.modules.linear.Identity",
    "pos_weight": 5
}

訓練ハイパーパラメータ

非デフォルトのハイパーパラメータ
- eval_strategy: steps
- per_device_train_batch_size: 2048
- per_device_eval_batch_size: 2048
- learning_rate: 0.0005
- num_train_epochs: 1
- warmup_ratio: 0.1
- seed: 12
- bf16: True
すべてのハイパーパラメータ：以下の展開ボタンをクリックして表示

クリックして展開

overwrite_output_dir: False
do_predict: False
eval_strategy: steps
prediction_loss_only: True
per_device_train_batch_size: 2048
per_device_eval_batch_size: 2048
per_gpu_train_batch_size: None
per_gpu_eval_batch_size: None
gradient_accumulation_steps: 1
eval_accumulation_steps: None
torch_empty_cache_steps: None
learning_rate: 0.0005
weight_decay: 0.0
adam_beta1: 0.9
adam_beta2: 0.999
adam_epsilon: 1e-08
max_grad_norm: 1.0
num_train_epochs: 1
max_steps: -1
lr_scheduler_type: linear
lr_scheduler_kwargs: {}
warmup_ratio: 0.1
warmup_steps: 0
log_level: passive
log_level_replica: warning
log_on_each_node: True
logging_nan_inf_filter: True
save_safetensors: True
save_on_each_node: False
save_only_model: False
restore_callback_states_from_checkpoint: False
no_cuda: False
use_cpu: False
use_mps_device: False
seed: 12
data_seed: None
jit_mode_eval: False
use_ipex: False
bf16: True
fp16: False
fp16_opt_level: O1
half_precision_backend: auto
bf16_full_eval: False
fp16_full_eval: False
tf32: None
local_rank: 0
ddp_backend: None
tpu_num_cores: None
tpu_metrics_debug: False
debug: []
dataloader_drop_last: False
dataloader_num_workers: 0
dataloader_prefetch_factor: None
past_index: -1
disable_tqdm: False
remove_unused_columns: True
label_names: None
load_best_model_at_end: False
ignore_data_skip: False
fsdp: []
fsdp_min_num_params: 0
fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
fsdp_transformer_layer_cls_to_wrap: None
accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
deepspeed: None
label_smoothing_factor: 0.0
optim: adamw_torch
optim_args: None
adafactor: False
group_by_length: False
length_column_name: length
ddp_find_unused_parameters: None
ddp_bucket_cap_mb: None
ddp_broadcast_buffers: False
dataloader_pin_memory: True
dataloader_persistent_workers: False
skip_memory_metrics: True
use_legacy_prediction_loop: False
push_to_hub: False
resume_from_checkpoint: None
hub_model_id: None
hub_strategy: every_save
hub_private_repo: None
hub_always_push: False
gradient_checkpointing: False
gradient_checkpointing_kwargs: None
include_inputs_for_metrics: False
include_for_metrics: []
eval_do_concat_batches: True
fp16_backend: auto
push_to_hub_model_id: None
push_to_hub_organization: None
mp_parameters:
auto_find_batch_size: False
full_determinism: False
torchdynamo: None
ray_scope: last
ddp_timeout: 1800
torch_compile: False
torch_compile_backend: None
torch_compile_mode: None
dispatch_batches: None
split_batches: None
include_tokens_per_second: False
include_num_input_tokens_seen: False
neftune_noise_alpha: None
optim_target_modules: None
batch_eval_metrics: False
eval_on_start: False
use_liger_kernel: False
eval_use_gather_object: False
average_tokens_across_devices: False
prompts: None
batch_sampler: batch_sampler
multi_dataset_batch_sampler: proportional

訓練ログ

エポック	ステップ	訓練損失	gooaq-dev_ndcg@10	NanoMSMARCO_ndcg@10	NanoNFCorpus_ndcg@10	NanoNQ_ndcg@10	NanoBEIR_R100_mean_ndcg@10
-1	-1	-	0.0887 (-0.5025)	0.0063 (-0.5341)	0.3262 (+0.0012)	0.0000 (-0.5006)	0.1108 (-0.3445)
0.0035	1	1.1945	-	-	-	-	-
0.0707	20	1.1664	0.4082 (-0.1830)	0.1805 (-0.3600)	0.3168 (-0.0083)	0.2243 (-0.2763)	0.2405 (-0.2149)
0.1413	40	1.1107	0.5260 (-0.0652)	0.3453 (-0.1951)	0.3335 (+0.0085)	0.3430 (-0.1576)	0.3406 (-0.1147)
0.2120	60	1.022	0.5623 (-0.0289)	0.3929 (-0.1475)	0.3512 (+0.0262)	0.3472 (-0.1535)	0.3638 (-0.0916)
0.2827	80	0.973	0.5691 (-0.0221)	0.4048 (-0.1356)	0.3530 (+0.0280)	0.3833 (-0.1174)	0.3804 (-0.0750)
0.3534	100	0.963	0.5814 (-0.0098)	0.4385 (-0.1019)	0.3471 (+0.0221)	0.4227 (-0.0779)	0.4028 (-0.0526)
0.4240	120	0.9419	0.5963 (+0.0050)	0.4106 (-0.1298)	0.3540 (+0.0289)	0.4843 (-0.0163)	0.4163 (-0.0391)
0.4947	140	0.9331	0.5953 (+0.0041)	0.4310 (-0.1094)	0.3367 (+0.0117)	0.4163 (-0.0843)	0.3947 (-0.0607)
0.5654	160	0.9263	0.6070 (+0.0158)	0.4626 (-0.0778)	0.3443 (+0.0193)	0.4823 (-0.0184)	0.4297 (-0.0256)
0.6360	180	0.9212	0.6069 (+0.0156)	0.4602 (-0.0802)	0.3391 (+0.0141)	0.4782 (-0.0224)	0.4258 (-0.0295)
0.7067	200	0.901	0.6126 (+0.0214)	0.4602 (-0.0803)	0.3413 (+0.0162)	0.4780 (-0.0227)	0.4265 (-0.0289)
0.7774	220	0.8997	0.6136 (+0.0224)	0.4801 (-0.0604)	0.3349 (+0.0098)	0.4903 (-0.0103)	0.4351 (-0.0203)
0.8481	240	0.9021	0.6132 (+0.0220)	0.4850 (-0.0554)	0.3438 (+0.0188)	0.4855 (-0.0151)	0.4381 (-0.0173)
0.9187	260	0.9013	0.6188 (+0.0276)	0.4820 (-0.0584)	0.3387 (+0.0137)	0.4851 (-0.0156)	0.4353 (-0.0201)
0.9894	280	0.8996	0.6157 (+0.0245)	0.4772 (-0.0632)	0.3305 (+0.0054)	0.4859 (-0.0147)	0.4312 (-0.0242)
-1	-1	-	0.6157 (+0.0245)	0.4772 (-0.0632)	0.3308 (+0.0058)	0.4859 (-0.0147)	0.4313 (-0.0241)

環境影響

CodeCarbonを使用して炭素排出量を測定：

エネルギー消費量：0.019 kWh
炭素排出量：0.007 kg CO2
使用時間：0.099時間

訓練ハードウェア

クラウドサービスを使用しているか：いいえ
GPUモデル：1 x NVIDIA GeForce RTX 3090
CPUモデル：13th Gen Intel(R) Core(TM) i7-13700K
メモリサイズ：31.78 GB

フレームワークバージョン

Python: 3.11.6
Sentence Transformers: 3.5.0.dev0
Transformers: 4.48.3
PyTorch: 2.5.0+cu121
Accelerate: 1.3.0
Datasets: 2.20.0
Tokenizers: 0.21.0

📄 ライセンス

このプロジェクトは、apache-2.0ライセンスを使用しています。

📖 引用

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}