reranker-gte-multilingual-base-msmarco-bce-ep-2开源模型

Home

Reranker Gte Multilingual Base Msmarco Bce Ep 2

Developed by skfrost19

基于sentence-transformers库在msmarco数据集上训练的交叉编码器模型，用于文本重排序和语义搜索

文本嵌入

Safetensors

Supports Multiple Languages#文本重排序 #语义搜索 #高精度评分

Downloads 28

Release Time : 4/6/2025

Model Overview

该模型计算文本对的分数，可用于文本重排序和语义搜索任务。它是在199万条训练样本上使用二元交叉熵损失函数训练得到的。

Model Features

高性能重排序

在NanoMSMARCO_R100数据集上达到0.7008的NDCG@10分数，表现优异

大规模训练

使用199万条训练样本进行训练，具有强大的语义理解能力

长文本支持

最大支持8192个标记的序列长度，适合处理长文本

Model Capabilities

文本对评分

语义搜索

搜索结果重排序

Use Cases

信息检索

搜索引擎结果重排序

对搜索引擎返回的结果进行重新排序，提升相关性

在MSMARCO数据集上NDCG@10达到0.7008

问答系统

答案相关性排序

对候选答案进行相关性排序，选择最佳答案

在NanoNQ_R100数据集上NDCG@10达到0.6888

🚀 CrossEncoder

CrossEncoder 是一个基于 sentence-transformers 库，在 msmarco 数据集上训练的 Cross Encoder 模型。它可以计算文本对的得分，可用于文本重排序和语义搜索。

✨ 主要特性

能够计算文本对的得分，适用于文本重排序和语义搜索任务。
支持长序列输入，最大序列长度可达 8192 个标记。

📦 安装指南

首先，安装 Sentence Transformers 库：

pip install -U sentence-transformers

💻 使用示例

基础用法

from sentence_transformers import CrossEncoder

# 从 🤗 Hub 下载模型
model = CrossEncoder("skfrost19/reranker-gte-multilingual-base-msmarco-bce-ep-2")
# 获取文本对的得分
pairs = [
    ['what symptoms might a patient with a tmd have', 'TMD sufferers have a long list of symptoms, including chronic pain (https://youtu.be/SvMaJb8o2RI), many of which are in common with Parkinsonâ\x80\x99s disease (PD) symptoms.'],
    ['what is a thermal protector', 'The word hero comes from the Greek á¼¥Ï\x81Ï\x89Ï\x82 (hÄ\x93rÅ\x8ds), hero, warrior, particularly one such as Heracles with divine ancestry or later given divine honors. literally protector or defender.'],
    ['how many copies of call of duty wwii sold', 'Call of Duty 3. Call of Duty 3 is a World War II first-person shooter and the third installment in the Call of Duty video game series. Released on November 7, 2006, the game was developed by Treyarch, and was the first major installment in the Call of Duty series not to be developed by Infinity Ward. It was also the first not to be released on the PC platform. It was released on the PlayStation 2, PlayStation 3, Wii, Xbox, and Xbox 360.'],
    ['what is the desired temperature for the fresh food compartment in a refrigerator', 'A refrigerator maintains a temperature a few degrees above the freezing point of water. Optimum temperature range for perishable food storage is 3 to 5 Â°C (37 to 41 Â°F).emperature settings for refrigerator and freezer compartments are often given arbitrary numbers by manufacturers (for example, 1 through 9, warmest to coldest), but generally 3 to 5 Â°C (37 to 41 Â°F) is ideal for the refrigerator compartment and â\x88\x9218 Â°C (0 Â°F) for the freezer.'],
    ['what is gsm alarm system', 'Iâ\x80\x99m sure you would have these questions in your mind when you heard GSM alarm system at the first time. GSM alarm system is an alarm system that operating through GSM (global system for mobile communications) network; not requiring a telephone line.urthermore, in the case of burglar entering the premises and cutting the telephone line, the GSM alarm would not be affected and still work as it does not require the use of a fixed phone line. So this security alarm is ideal for the place where no fixed phone line or hard to get one.'],
]
scores = model.predict(pairs)
print(scores.shape)
# (5,)

# 或者根据与单个文本的相似度对不同文本进行排序
ranks = model.rank(
    'what symptoms might a patient with a tmd have',
    [
        'TMD sufferers have a long list of symptoms, including chronic pain (https://youtu.be/SvMaJb8o2RI), many of which are in common with Parkinsonâ\x80\x99s disease (PD) symptoms.',
        'The word hero comes from the Greek á¼¥Ï\x81Ï\x89Ï\x82 (hÄ\x93rÅ\x8ds), hero, warrior, particularly one such as Heracles with divine ancestry or later given divine honors. literally protector or defender.',
        'Call of Duty 3. Call of Duty 3 is a World War II first-person shooter and the third installment in the Call of Duty video game series. Released on November 7, 2006, the game was developed by Treyarch, and was the first major installment in the Call of Duty series not to be developed by Infinity Ward. It was also the first not to be released on the PC platform. It was released on the PlayStation 2, PlayStation 3, Wii, Xbox, and Xbox 360.',
        'A refrigerator maintains a temperature a few degrees above the freezing point of water. Optimum temperature range for perishable food storage is 3 to 5 Â°C (37 to 41 Â°F).emperature settings for refrigerator and freezer compartments are often given arbitrary numbers by manufacturers (for example, 1 through 9, warmest to coldest), but generally 3 to 5 Â°C (37 to 41 Â°F) is ideal for the refrigerator compartment and â\x88\x9218 Â°C (0 Â°F) for the freezer.',
        'Iâ\x80\x99m sure you would have these questions in your mind when you heard GSM alarm system at the first time. GSM alarm system is an alarm system that operating through GSM (global system for mobile communications) network; not requiring a telephone line.urthermore, in the case of burglar entering the premises and cutting the telephone line, the GSM alarm would not be affected and still work as it does not require the use of a fixed phone line. So this security alarm is ideal for the place where no fixed phone line or hard to get one.',
    ]
)
# [{'corpus_id': ..., 'score': ...}, {'corpus_id': ..., 'score': ...}, ...]

📚 详细文档

模型详情

模型描述

属性	详情
模型类型	Cross Encoder
最大序列长度	8192 个标记
输出标签数量	1 个标签
训练数据集	msmarco
语言	英语

模型来源

文档：Sentence Transformers 文档
文档：Cross Encoder 文档
仓库：GitHub 上的 Sentence Transformers
Hugging Face：Hugging Face 上的 Cross Encoders

评估

评估指标

Cross Encoder 重排序

数据集：NanoMSMARCO_R100、NanoNFCorpus_R100 和 NanoNQ_R100
评估方法：使用 CrossEncoderRerankingEvaluator 进行评估，参数如下：

{
    "at_k": 10,
    "always_rerank_positives": true
}

指标	NanoMSMARCO_R100	NanoNFCorpus_R100	NanoNQ_R100
map	0.6439 (+0.1543)	0.3324 (+0.0714)	0.6174 (+0.1978)
mrr@10	0.6382 (+0.1607)	0.5372 (+0.0374)	0.6412 (+0.2145)
ndcg@10	0.7008 (+0.1604)	0.3732 (+0.0482)	0.6888 (+0.1881)

Cross Encoder Nano BEIR

数据集：NanoBEIR_R100_mean
评估方法：使用 CrossEncoderNanoBEIREvaluator 进行评估，参数如下：

{
    "dataset_names": [
        "msmarco",
        "nfcorpus",
        "nq"
    ],
    "rerank_k": 100,
    "at_k": 10,
    "always_rerank_positives": true
}

指标	值
map	0.5312 (+0.1412)
mrr@10	0.6055 (+0.1375)
ndcg@10	0.5876 (+0.1322)

训练详情

训练数据集

数据集：msmarco，版本为 9e329ed
大小：1,990,000 个训练样本
列：query、passage 和 score
近似统计信息（基于前 1000 个样本）： | | 查询 | 段落 | 得分 | | ---- | ---- | ---- | ---- | | 类型 | 字符串 | 字符串 | 浮点数 | | 详情 |
- 最小长度：11 个字符
- 平均长度：34.61 个字符
- 最大长度：124 个字符
|
- 最小长度：82 个字符
- 平均长度：357.43 个字符
- 最大长度：1034 个字符
|
- 最小值：0.0
- 平均值：0.49
- 最大值：1.0
|
样本： | 查询 | 段落 | 得分 | | ---- | ---- | ---- | | what causes your tailbone to hurt | A coccyx injury results in pain and discomfort in the tailbone area (the condition is called coccydynia). These injuries may result in a bruise, dislocation, or fracture (break) of the coccyx. Although they may be slow to heal, the majority of coccyx injuries can be managed with cautious treatment.ost tailbone injuries are caused by trauma to the coccyx area. 1 A fall onto the tailbone in the seated position, usually against a hard surface, is the most common cause of coccyx injuries. 2 A direct blow to the tailbone, such as those that occur during contact sports, can injure the coccyx. | 1.0 | | what muscles do trunk lateral flexion | Itâs the same with the External Obliques, but unlike the External Obliques, they are not visible when fully developed. Action: 1 Supports abdominal wall, assists forced respiration, aids raising intra-abdominal pressure and, with muscles of other side, abducts and rotates trunk. 2 Contraction of one side alone laterally bends the trunk to that side and rotates the trunk to the other side. | 0.0 | | brake horsepower definition | When the brake lights will not come on, the first thing to check is the third-brake light. If it too is not working, the brake-light switch, a bad fuse or an unplugged harness is likely.ull up on the brake pedal and if the lights go out, switch mis-alignment or pedal position error is the likely cause. The final possibility is a wire shorted to power. Unplug the brake-light switch and if the lights stay on, a short circuit is the case. | 0.0 |
损失函数：BinaryCrossEntropyLoss，参数如下：

{
    "activation_fn": "torch.nn.modules.linear.Identity",
    "pos_weight": null
}

评估数据集

数据集：msmarco，版本为 9e329ed
大小：10,000 个评估样本
列：query、passage 和 score
近似统计信息（基于前 1000 个样本）： | | 查询 | 段落 | 得分 | | ---- | ---- | ---- | ---- | | 类型 | 字符串 | 字符串 | 浮点数 | | 详情 |
- 最小长度：9 个字符
- 平均长度：33.72 个字符
- 最大长度：193 个字符
|
- 最小长度：55 个字符
- 平均长度：353.35 个字符
- 最大长度：895 个字符
|
- 最小值：0.0
- 平均值：0.5
- 最大值：1.0
|
样本： | 查询 | 段落 | 得分 | | ---- | ---- | ---- | | what symptoms might a patient with a tmd have | TMD sufferers have a long list of symptoms, including chronic pain (https://youtu.be/SvMaJb8o2RI), many of which are in common with Parkinsonâs disease (PD) symptoms. | 1.0 | | what is a thermal protector | The word hero comes from the Greek á¼¥ÏÏÏ (hÄrÅs), hero, warrior, particularly one such as Heracles with divine ancestry or later given divine honors. literally protector or defender. | 0.0 | | how many copies of call of duty wwii sold | Call of Duty 3. Call of Duty 3 is a World War II first-person shooter and the third installment in the Call of Duty video game series. Released on November 7, 2006, the game was developed by Treyarch, and was the first major installment in the Call of Duty series not to be developed by Infinity Ward. It was also the first not to be released on the PC platform. It was released on the PlayStation 2, PlayStation 3, Wii, Xbox, and Xbox 360. | 0.0 |
损失函数：BinaryCrossEntropyLoss，参数如下：

{
    "activation_fn": "torch.nn.modules.linear.Identity",
    "pos_weight": null
}

训练超参数

非默认超参数

eval_strategy: steps
per_device_train_batch_size: 156
per_device_eval_batch_size: 156
learning_rate: 2e-05
num_train_epochs: 2
warmup_ratio: 0.1
seed: 12
bf16: True
dataloader_num_workers: 4
load_best_model_at_end: True

所有超参数

点击展开

overwrite_output_dir: False
do_predict: False
eval_strategy: steps
prediction_loss_only: True
per_device_train_batch_size: 156
per_device_eval_batch_size: 156
per_gpu_train_batch_size: None
per_gpu_eval_batch_size: None
gradient_accumulation_steps: 1
eval_accumulation_steps: None
torch_empty_cache_steps: None
learning_rate: 2e-05
weight_decay: 0.0
adam_beta1: 0.9
adam_beta2: 0.999
adam_epsilon: 1e-08
max_grad_norm: 1.0
num_train_epochs: 2
max_steps: -1
lr_scheduler_type: linear
lr_scheduler_kwargs: {}
warmup_ratio: 0.1
warmup_steps: 0
log_level: passive
log_level_replica: warning
log_on_each_node: True
logging_nan_inf_filter: True
save_safetensors: True
save_on_each_node: False
save_only_model: False
restore_callback_states_from_checkpoint: False
no_cuda: False
use_cpu: False
use_mps_device: False
seed: 12
data_seed: None
jit_mode_eval: False
use_ipex: False
bf16: True
fp16: False
fp16_opt_level: O1
half_precision_backend: auto
bf16_full_eval: False
fp16_full_eval: False
tf32: None
local_rank: 0
ddp_backend: None
tpu_num_cores: None
tpu_metrics_debug: False
debug: []
dataloader_drop_last: False
dataloader_num_workers: 4
dataloader_prefetch_factor: None
past_index: -1
disable_tqdm: False
remove_unused_columns: True
label_names: None
load_best_model_at_end: True
ignore_data_skip: False
fsdp: []
fsdp_min_num_params: 0
fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
tp_size: 0
fsdp_transformer_layer_cls_to_wrap: None
accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
deepspeed: None
label_smoothing_factor: 0.0
optim: adamw_torch
optim_args: None
adafactor: False
group_by_length: False
length_column_name: length
ddp_find_unused_parameters: None
ddp_bucket_cap_mb: None
ddp_broadcast_buffers: False
dataloader_pin_memory: True
dataloader_persistent_workers: False
skip_memory_metrics: True
use_legacy_prediction_loop: False
push_to_hub: False
resume_from_checkpoint: None
hub_model_id: None
hub_strategy: every_save
hub_private_repo: None
hub_always_push: False
gradient_checkpointing: False
gradient_checkpointing_kwargs: None
include_inputs_for_metrics: False
include_for_metrics: []
eval_do_concat_batches: True
fp16_backend: auto
push_to_hub_model_id: None
push_to_hub_organization: None
mp_parameters:
auto_find_batch_size: False
full_determinism: False
torchdynamo: None
ray_scope: last
ddp_timeout: 1800
torch_compile: False
torch_compile_backend: None
torch_compile_mode: None
dispatch_batches: None
split_batches: None
include_tokens_per_second: False
include_num_input_tokens_seen: False
neftune_noise_alpha: None
optim_target_modules: None
batch_eval_metrics: False
eval_on_start: False
use_liger_kernel: False
eval_use_gather_object: False
average_tokens_across_devices: False
prompts: None
batch_sampler: batch_sampler
multi_dataset_batch_sampler: proportional

训练日志

轮次	步数	训练损失	验证损失	NanoMSMARCO_R100_ndcg@10	NanoNFCorpus_R100_ndcg@10	NanoNQ_R100_ndcg@10	NanoBEIR_R100_mean_ndcg@10
-1	-1	-	-	0.6561 (+0.1157)	0.3777 (+0.0527)	0.6548 (+0.1541)	0.5629 (+0.1075)
0.0001	1	0.1089	-	-	-	-	-
0.3136	4000	0.1258	-	-	-	-	-
0.6271	8000	0.1185	-	-	-	-	-
0.7839	10000	-	0.1177	0.7008 (+0.1604)	0.3732 (+0.0482)	0.6888 (+0.1881)	0.5876 (+0.1322)
0.9407	12000	0.1231	-	-	-	-	-
-1	-1	-	-	0.7008 (+0.1604)	0.3732 (+0.0482)	0.6888 (+0.1881)	0.5876 (+0.1322)

注：加粗行表示保存的检查点。

框架版本

Python: 3.11.5
Sentence Transformers: 4.0.1
Transformers: 4.50.3
PyTorch: 2.6.0+cu124
Accelerate: 1.6.0
Datasets: 3.5.0
Tokenizers: 0.21.1

📄 许可证

文档中未提及该模型的许可证信息。

📚 引用

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}