Amber Large
モデル概要
モデル特徴
モデル能力
使用事例
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- mteb base_model: sbintuitions/modernbert-ja-310m language:
- ja
- en model-index:
- name: retrieva-jp/amber-large
results:
- dataset:
config: en
name: MTEB AmazonCounterfactualClassification (en)
revision: e8379541af4e31359cca9fbcf4b00f2671dba205
split: test
type: mteb/amazon_counterfactual
metrics:
- type: accuracy value: 73.3433
- type: f1 value: 67.2899
- type: f1_weighted value: 75.7948
- type: ap value: 36.123
- type: ap_weighted value: 36.123
- type: main_score value: 73.3433 task: type: Classification
- dataset:
config: default
name: MTEB ArXivHierarchicalClusteringP2P (default)
revision: 0bbdb47bcbe3a90093699aefeed338a0f28a7ee8
split: test
type: mteb/arxiv-clustering-p2p
metrics:
- type: v_measure value: 53.3936
- type: v_measure_std value: 3.9726999999999997
- type: main_score value: 53.3936 task: type: Clustering
- dataset:
config: default
name: MTEB ArXivHierarchicalClusteringS2S (default)
revision: b73bd54100e5abfa6e3a23dcafb46fe4d2438dc3
split: test
type: mteb/arxiv-clustering-s2s
metrics:
- type: v_measure value: 51.35999999999999
- type: v_measure_std value: 4.9623
- type: main_score value: 51.35999999999999 task: type: Clustering
- dataset:
config: default
name: MTEB ArguAna (default)
revision: c22ab2a51041ffd869aaddef7af8d8215647e41a
split: test
type: mteb/arguana
metrics:
- type: ndcg_at_1 value: 26.743
- type: ndcg_at_3 value: 40.550999999999995
- type: ndcg_at_5 value: 45.550000000000004
- type: ndcg_at_10 value: 51.317
- type: ndcg_at_20 value: 53.96300000000001
- type: ndcg_at_100 value: 55.358
- type: ndcg_at_1000 value: 55.596000000000004
- type: map_at_1 value: 26.743
- type: map_at_3 value: 37.162
- type: map_at_5 value: 39.964
- type: map_at_10 value: 42.355
- type: map_at_20 value: 43.1
- type: map_at_100 value: 43.313
- type: map_at_1000 value: 43.323
- type: recall_at_1 value: 26.743
- type: recall_at_3 value: 50.356
- type: recall_at_5 value: 62.376
- type: recall_at_10 value: 80.156
- type: recall_at_20 value: 90.469
- type: recall_at_100 value: 97.724
- type: recall_at_1000 value: 99.502
- type: precision_at_1 value: 26.743
- type: precision_at_3 value: 16.785
- type: precision_at_5 value: 12.475
- type: precision_at_10 value: 8.016
- type: precision_at_20 value: 4.523
- type: precision_at_100 value: 0.9769999999999999
- type: precision_at_1000 value: 0.1
- type: mrr_at_1 value: 27.169300000000003
- type: mrr_at_3 value: 37.411100000000005
- type: mrr_at_5 value: 40.1102
- type: mrr_at_10 value: 42.493900000000004
- type: mrr_at_20 value: 43.2491
- type: mrr_at_100 value: 43.4578
- type: mrr_at_1000 value: 43.4685
- type: nauc_ndcg_at_1_max value: -6.2333
- type: nauc_ndcg_at_1_std value: -7.9555
- type: nauc_ndcg_at_1_diff1 value: 14.512
- type: nauc_ndcg_at_3_max value: -2.1475999999999997
- type: nauc_ndcg_at_3_std value: -5.8094
- type: nauc_ndcg_at_3_diff1 value: 9.136
- type: nauc_ndcg_at_5_max value: -1.7067999999999999
- type: nauc_ndcg_at_5_std value: -5.018800000000001
- type: nauc_ndcg_at_5_diff1 value: 9.4328
- type: nauc_ndcg_at_10_max value: 0.7445
- type: nauc_ndcg_at_10_std value: -3.5482
- type: nauc_ndcg_at_10_diff1 value: 11.1
- type: nauc_ndcg_at_20_max value: 0.47200000000000003
- type: nauc_ndcg_at_20_std value: -3.3912999999999998
- type: nauc_ndcg_at_20_diff1 value: 11.2196
- type: nauc_ndcg_at_100_max value: -1.1079
- type: nauc_ndcg_at_100_std value: -3.8186999999999998
- type: nauc_ndcg_at_100_diff1 value: 10.9808
- type: nauc_ndcg_at_1000_max value: -1.3786
- type: nauc_ndcg_at_1000_std value: -4.3135
- type: nauc_ndcg_at_1000_diff1 value: 10.9463
- type: nauc_map_at_1_max value: -6.2333
- type: nauc_map_at_1_std value: -7.9555
- type: nauc_map_at_1_diff1 value: 14.512
- type: nauc_map_at_3_max value: -3.3211999999999997
- type: nauc_map_at_3_std value: -6.2437
- type: nauc_map_at_3_diff1 value: 10.1283
- type: nauc_map_at_5_max value: -3.0931
- type: nauc_map_at_5_std value: -5.7626
- type: nauc_map_at_5_diff1 value: 10.3327
- type: nauc_map_at_10_max value: -2.2469
- type: nauc_map_at_10_std value: -5.2611
- type: nauc_map_at_10_diff1 value: 11.017100000000001
- type: nauc_map_at_20_max value: -2.358
- type: nauc_map_at_20_std value: -5.255
- type: nauc_map_at_20_diff1 value: 11.0437
- type: nauc_map_at_100_max value: -2.5533
- type: nauc_map_at_100_std value: -5.2893
- type: nauc_map_at_100_diff1 value: 11.018600000000001
- type: nauc_map_at_1000_max value: -2.5621
- type: nauc_map_at_1000_std value: -5.3072
- type: nauc_map_at_1000_diff1 value: 11.0196
- type: nauc_recall_at_1_max value: -6.2333
- type: nauc_recall_at_1_std value: -7.9555
- type: nauc_recall_at_1_diff1 value: 14.512
- type: nauc_recall_at_3_max value: 1.2414
- type: nauc_recall_at_3_std value: -4.6148
- type: nauc_recall_at_3_diff1 value: 6.45
- type: nauc_recall_at_5_max value: 2.7998
- type: nauc_recall_at_5_std value: -2.6652
- type: nauc_recall_at_5_diff1 value: 6.7526
- type: nauc_recall_at_10_max value: 17.322100000000002
- type: nauc_recall_at_10_std value: 5.9032
- type: nauc_recall_at_10_diff1 value: 12.881899999999998
- type: nauc_recall_at_20_max value: 29.6782
- type: nauc_recall_at_20_std value: 16.4192
- type: nauc_recall_at_20_diff1 value: 15.8604
- type: nauc_recall_at_100_max value: 28.772599999999997
- type: nauc_recall_at_100_std value: 48.7738
- type: nauc_recall_at_100_diff1 value: 15.8629
- type: nauc_recall_at_1000_max value: 31.0293
- type: nauc_recall_at_1000_std value: 52.7185
- type: nauc_recall_at_1000_diff1 value: 14.3646
- type: nauc_precision_at_1_max value: -6.2333
- type: nauc_precision_at_1_std value: -7.9555
- type: nauc_precision_at_1_diff1 value: 14.512
- type: nauc_precision_at_3_max value: 1.2414
- type: nauc_precision_at_3_std value: -4.6148
- type: nauc_precision_at_3_diff1 value: 6.45
- type: nauc_precision_at_5_max value: 2.7998
- type: nauc_precision_at_5_std value: -2.6652
- type: nauc_precision_at_5_diff1 value: 6.7526
- type: nauc_precision_at_10_max value: 17.322100000000002
- type: nauc_precision_at_10_std value: 5.9032
- type: nauc_precision_at_10_diff1 value: 12.881899999999998
- type: nauc_precision_at_20_max value: 29.6782
- type: nauc_precision_at_20_std value: 16.4192
- type: nauc_precision_at_20_diff1 value: 15.8604
- type: nauc_precision_at_100_max value: 28.772599999999997
- type: nauc_precision_at_100_std value: 48.7738
- type: nauc_precision_at_100_diff1 value: 15.8629
- type: nauc_precision_at_1000_max value: 31.0293
- type: nauc_precision_at_1000_std value: 52.7185
- type: nauc_precision_at_1000_diff1 value: 14.3646
- type: nauc_mrr_at_1_max value: -6.0675
- type: nauc_mrr_at_1_std value: -7.0283999999999995
- type: nauc_mrr_at_1_diff1 value: 13.1112
- type: nauc_mrr_at_3_max value: -3.8593
- type: nauc_mrr_at_3_std value: -5.9281
- type: nauc_mrr_at_3_diff1 value: 8.807
- type: nauc_mrr_at_5_max value: -3.6332999999999998
- type: nauc_mrr_at_5_std value: -5.3816999999999995
- type: nauc_mrr_at_5_diff1 value: 9.0466
- type: nauc_mrr_at_10_max value: -2.8869
- type: nauc_mrr_at_10_std value: -4.9811000000000005
- type: nauc_mrr_at_10_diff1 value: 9.589699999999999
- type: nauc_mrr_at_20_max value: -2.9609
- type: nauc_mrr_at_20_std value: -4.9429
- type: nauc_mrr_at_20_diff1 value: 9.6326
- type: nauc_mrr_at_100_max value: -3.15
- type: nauc_mrr_at_100_std value: -4.9643
- type: nauc_mrr_at_100_diff1 value: 9.6056
- type: nauc_mrr_at_1000_max value: -3.159
- type: nauc_mrr_at_1000_std value: -4.982
- type: nauc_mrr_at_1000_diff1 value: 9.6061
- type: main_score value: 51.317 task: type: Retrieval
- dataset:
config: default
name: MTEB AskUbuntuDupQuestions (default)
revision: 2000358ca161889fa9c082cb41daa8dcfb161a54
split: test
type: mteb/askubuntudupquestions-reranking
metrics:
- type: map value: 58.0233
- type: mrr value: 70.5882
- type: nAUC_map_max value: 20.8533
- type: nAUC_map_std value: 12.612300000000001
- type: nAUC_map_diff1 value: 1.3859
- type: nAUC_mrr_max value: 33.692
- type: nAUC_mrr_std value: 14.176400000000001
- type: nAUC_mrr_diff1 value: 14.2379
- type: main_score value: 58.0233 task: type: Reranking
- dataset:
config: default
name: MTEB BIOSSES (default)
revision: d3fb88f8f02e40887cd149695127462bbcf29b4a
split: test
type: mteb/biosses-sts
metrics:
- type: pearson value: 83.4314
- type: spearman value: 78.7367
- type: cosine_pearson value: 83.4314
- type: cosine_spearman value: 78.7367
- type: manhattan_pearson value: 82.1388
- type: manhattan_spearman value: 78.747
- type: euclidean_pearson value: 82.1716
- type: euclidean_spearman value: 78.7367
- type: main_score value: 78.7367 task: type: STS
- dataset:
config: default
name: MTEB Banking77Classification (default)
revision: 0fd18e25b25c072e09e0d92ab615fda904d66300
split: test
type: mteb/banking77
metrics:
- type: accuracy value: 76.8961
- type: f1 value: 75.8746
- type: f1_weighted value: 75.8746
- type: main_score value: 76.8961 task: type: Classification
- dataset:
config: default
name: MTEB BiorxivClusteringP2P.v2 (default)
revision: f5dbc242e11dd8e24def4c4268607a49e02946dc
split: test
type: mteb/biorxiv-clustering-p2p
metrics:
- type: v_measure value: 36.2676
- type: v_measure_std value: 0.8959
- type: main_score value: 36.2676 task: type: Clustering
- dataset:
config: default
name: MTEB CQADupstackGamingRetrieval (default)
revision: 4885aa143210c98657558c04aaf3dc47cfb54340
split: test
type: mteb/cqadupstack-gaming
metrics:
- type: ndcg_at_1 value: 36.489
- type: ndcg_at_3 value: 42.821999999999996
- type: ndcg_at_5 value: 44.915
- type: ndcg_at_10 value: 47.74
- type: ndcg_at_20 value: 49.613
- type: ndcg_at_100 value: 52.406
- type: ndcg_at_1000 value: 53.984
- type: map_at_1 value: 31.812
- type: map_at_3 value: 39.568
- type: map_at_5 value: 40.976
- type: map_at_10 value: 42.36
- type: map_at_20 value: 42.978
- type: map_at_100 value: 43.418
- type: map_at_1000 value: 43.488
- type: recall_at_1 value: 31.812
- type: recall_at_3 value: 47.199999999999996
- type: recall_at_5 value: 52.361999999999995
- type: recall_at_10 value: 60.535000000000004
- type: recall_at_20 value: 67.51899999999999
- type: recall_at_100 value: 81.432
- type: recall_at_1000 value: 92.935
- type: precision_at_1 value: 36.489
- type: precision_at_3 value: 19.269
- type: precision_at_5 value: 13.116
- type: precision_at_10 value: 7.818
- type: precision_at_20 value: 4.4670000000000005
- type: precision_at_100 value: 1.107
- type: precision_at_1000 value: 0.13
- type: mrr_at_1 value: 36.489
- type: mrr_at_3 value: 43.2602
- type: mrr_at_5 value: 44.4514
- type: mrr_at_10 value: 45.510600000000004
- type: mrr_at_20 value: 45.9739
- type: mrr_at_100 value: 46.3047
- type: mrr_at_1000 value: 46.3441
- type: nauc_ndcg_at_1_max value: 32.7997
- type: nauc_ndcg_at_1_std value: -6.2432
- type: nauc_ndcg_at_1_diff1 value: 51.348499999999994
- type: nauc_ndcg_at_3_max value: 30.573299999999996
- type: nauc_ndcg_at_3_std value: -5.183999999999999
- type: nauc_ndcg_at_3_diff1 value: 45.3705
- type: nauc_ndcg_at_5_max value: 30.7409
- type: nauc_ndcg_at_5_std value: -4.0355
- type: nauc_ndcg_at_5_diff1 value: 44.6049
- type: nauc_ndcg_at_10_max value: 31.533699999999996
- type: nauc_ndcg_at_10_std value: -2.8769
- type: nauc_ndcg_at_10_diff1 value: 44.3542
- type: nauc_ndcg_at_20_max value: 32.0732
- type: nauc_ndcg_at_20_std value: -1.872
- type: nauc_ndcg_at_20_diff1 value: 44.2475
- type: nauc_ndcg_at_100_max value: 32.671
- type: nauc_ndcg_at_100_std value: -1.1646999999999998
- type: nauc_ndcg_at_100_diff1 value: 44.2262
- type: nauc_ndcg_at_1000_max value: 32.9504
- type: nauc_ndcg_at_1000_std value: -1.0373999999999999
- type: nauc_ndcg_at_1000_diff1 value: 44.507999999999996
- type: nauc_map_at_1_max value: 29.0809
- type: nauc_map_at_1_std value: -6.367000000000001
- type: nauc_map_at_1_diff1 value: 51.906200000000005
- type: nauc_map_at_3_max value: 30.127
- type: nauc_map_at_3_std value: -6.1406
- type: nauc_map_at_3_diff1 value: 47.131099999999996
- type: nauc_map_at_5_max value: 30.2421
- type: nauc_map_at_5_std value: -5.4726
- type: nauc_map_at_5_diff1 value: 46.6666
- type: nauc_map_at_10_max value: 30.826500000000003
- type: nauc_map_at_10_std value: -4.8187
- type: nauc_map_at_10_diff1 value: 46.5314
- type: nauc_map_at_20_max value: 31.1207
- type: nauc_map_at_20_std value: -4.3886
- type: nauc_map_at_20_diff1 value: 46.4738
- type: nauc_map_at_100_max value: 31.2728
- type: nauc_map_at_100_std value: -4.2386
- type: nauc_map_at_100_diff1 value: 46.4656
- type: nauc_map_at_1000_max value: 31.307499999999997
- type: nauc_map_at_1000_std value: -4.213900000000001
- type: nauc_map_at_1000_diff1 value: 46.4827
- type: nauc_recall_at_1_max value: 29.0809
- type: nauc_recall_at_1_std value: -6.367000000000001
- type: nauc_recall_at_1_diff1 value: 51.906200000000005
- type: nauc_recall_at_3_max value: 28.213
- type: nauc_recall_at_3_std value: -4.8443
- type: nauc_recall_at_3_diff1 value: 40.3982
- type: nauc_recall_at_5_max value: 28.038200000000003
- type: nauc_recall_at_5_std value: -1.8623
- type: nauc_recall_at_5_diff1 value: 38.1102
- type: nauc_recall_at_10_max value: 29.4193
- type: nauc_recall_at_10_std value: 1.821
- type: nauc_recall_at_10_diff1 value: 36.262899999999995
- type: nauc_recall_at_20_max value: 31.0056
- type: nauc_recall_at_20_std value: 6.6465
- type: nauc_recall_at_20_diff1 value: 34.9446
- type: nauc_recall_at_100_max value: 33.3618
- type: nauc_recall_at_100_std value: 16.1202
- type: nauc_recall_at_100_diff1 value: 29.264699999999998
- type: nauc_recall_at_1000_max value: 40.03
- type: nauc_recall_at_1000_std value: 40.261
- type: nauc_recall_at_1000_diff1 value: 19.1627
- type: nauc_precision_at_1_max value: 32.7997
- type: nauc_precision_at_1_std value: -6.2432
- type: nauc_precision_at_1_diff1 value: 51.348499999999994
- type: nauc_precision_at_3_max value: 30.527900000000002
- type: nauc_precision_at_3_std value: -2.2055000000000002
- type: nauc_precision_at_3_diff1 value: 31.7838
- type: nauc_precision_at_5_max value: 29.078
- type: nauc_precision_at_5_std value: 1.7718
- type: nauc_precision_at_5_diff1 value: 26.0635
- type: nauc_precision_at_10_max value: 28.903499999999998
- type: nauc_precision_at_10_std value: 7.321
- type: nauc_precision_at_10_diff1 value: 19.4822
- type: nauc_precision_at_20_max value: 29.5105
- type: nauc_precision_at_20_std value: 12.931999999999999
- type: nauc_precision_at_20_diff1 value: 14.0846
- type: nauc_precision_at_100_max value: 27.9082
- type: nauc_precision_at_100_std value: 19.1086
- type: nauc_precision_at_100_diff1 value: 4.7168
- type: nauc_precision_at_1000_max value: 24.2535
- type: nauc_precision_at_1000_std value: 19.430500000000002
- type: nauc_precision_at_1000_diff1 value: -1.262
- type: nauc_mrr_at_1_max value: 32.7997
- type: nauc_mrr_at_1_std value: -6.2432
- type: nauc_mrr_at_1_diff1 value: 51.348499999999994
- type: nauc_mrr_at_3_max value: 32.4347
- type: nauc_mrr_at_3_std value: -5.0054
- type: nauc_mrr_at_3_diff1 value: 46.2024
- type: nauc_mrr_at_5_max value: 32.7235
- type: nauc_mrr_at_5_std value: -4.239
- type: nauc_mrr_at_5_diff1 value: 46.0496
- type: nauc_mrr_at_10_max value: 32.7692
- type: nauc_mrr_at_10_std value: -3.9257
- type: nauc_mrr_at_10_diff1 value: 46.009699999999995
- type: nauc_mrr_at_20_max value: 32.8372
- type: nauc_mrr_at_20_std value: -3.7516000000000003
- type: nauc_mrr_at_20_diff1 value: 45.9608
- type: nauc_mrr_at_100_max value: 32.845200000000006
- type: nauc_mrr_at_100_std value: -3.7661
- type: nauc_mrr_at_100_diff1 value: 45.988600000000005
- type: nauc_mrr_at_1000_max value: 32.8484
- type: nauc_mrr_at_1000_std value: -3.7553
- type: nauc_mrr_at_1000_diff1 value: 45.9936
- type: main_score value: 47.74 task: type: Retrieval
- dataset:
config: default
name: MTEB CQADupstackUnixRetrieval (default)
revision: 6c6430d3a6d36f8d2a829195bc5dc94d7e063e53
split: test
type: mteb/cqadupstack-unix
metrics:
- type: ndcg_at_1 value: 24.813
- type: ndcg_at_3 value: 28.232000000000003
- type: ndcg_at_5 value: 30.384
- type: ndcg_at_10 value: 32.482
- type: ndcg_at_20 value: 34.627
- type: ndcg_at_100 value: 38.275
- type: ndcg_at_1000 value: 41.07
- type: map_at_1 value: 21.176000000000002
- type: map_at_3 value: 25.75
- type: map_at_5 value: 27.169999999999998
- type: map_at_10 value: 28.081
- type: map_at_20 value: 28.698
- type: map_at_100 value: 29.264000000000003
- type: map_at_1000 value: 29.38
- type: recall_at_1 value: 21.176000000000002
- type: recall_at_3 value: 30.842000000000002
- type: recall_at_5 value: 36.265
- type: recall_at_10 value: 42.531
- type: recall_at_20 value: 50.314
- type: recall_at_100 value: 68.13900000000001
- type: recall_at_1000 value: 88.252
- type: precision_at_1 value: 24.813
- type: precision_at_3 value: 12.687000000000001
- type: precision_at_5 value: 9.049
- type: precision_at_10 value: 5.401
- type: precision_at_20 value: 3.274
- type: precision_at_100 value: 0.9329999999999999
- type: precision_at_1000 value: 0.129
- type: mrr_at_1 value: 24.813399999999998
- type: mrr_at_3 value: 29.446499999999997
- type: mrr_at_5 value: 30.747799999999998
- type: mrr_at_10 value: 31.6057
- type: mrr_at_20 value: 32.2122
- type: mrr_at_100 value: 32.6663
- type: mrr_at_1000 value: 32.734
- type: nauc_ndcg_at_1_max value: 34.191
- type: nauc_ndcg_at_1_std value: 0.2555
- type: nauc_ndcg_at_1_diff1 value: 55.12590000000001
- type: nauc_ndcg_at_3_max value: 31.232599999999998
- type: nauc_ndcg_at_3_std value: 2.2289
- type: nauc_ndcg_at_3_diff1 value: 48.0837
- type: nauc_ndcg_at_5_max value: 30.962400000000002
- type: nauc_ndcg_at_5_std value: 3.4008999999999996
- type: nauc_ndcg_at_5_diff1 value: 46.4811
- type: nauc_ndcg_at_10_max value: 31.446600000000004
- type: nauc_ndcg_at_10_std value: 4.1986
- type: nauc_ndcg_at_10_diff1 value: 45.393499999999996
- type: nauc_ndcg_at_20_max value: 32.1259
- type: nauc_ndcg_at_20_std value: 4.8191999999999995
- type: nauc_ndcg_at_20_diff1 value: 45.5339
- type: nauc_ndcg_at_100_max value: 31.741799999999998
- type: nauc_ndcg_at_100_std value: 6.5873
- type: nauc_ndcg_at_100_diff1 value: 45.1915
- type: nauc_ndcg_at_1000_max value: 32.1615
- type: nauc_ndcg_at_1000_std value: 6.5815
- type: nauc_ndcg_at_1000_diff1 value: 45.4801
- type: nauc_map_at_1_max value: 33.592499999999994
- type: nauc_map_at_1_std value: -0.8531000000000001
- type: nauc_map_at_1_diff1 value: 56.7096
- type: nauc_map_at_3_max value: 31.6479
- type: nauc_map_at_3_std value: 1.2515999999999998
- type: nauc_map_at_3_diff1 value: 50.4096
- type: nauc_map_at_5_max value: 31.3468
- type: nauc_map_at_5_std value: 1.9414
- type: nauc_map_at_5_diff1 value: 49.3593
- type: nauc_map_at_10_max value: 31.494
- type: nauc_map_at_10_std value: 2.298
- type: nauc_map_at_10_diff1 value: 48.809799999999996
- type: nauc_map_at_20_max value: 31.724000000000004
- type: nauc_map_at_20_std value: 2.5317
- type: nauc_map_at_20_diff1 value: 48.825
- type: nauc_map_at_100_max value: 31.671100000000003
- type: nauc_map_at_100_std value: 2.8145
- type: nauc_map_at_100_diff1 value: 48.7271
- type: nauc_map_at_1000_max value: 31.689
- type: nauc_map_at_1000_std value: 2.8294
- type: nauc_map_at_1000_diff1 value: 48.7329
- type: nauc_recall_at_1_max value: 33.592499999999994
- type: nauc_recall_at_1_std value: -0.8531000000000001
- type: nauc_recall_at_1_diff1 value: 56.7096
- type: nauc_recall_at_3_max value: 29.4439
- type: nauc_recall_at_3_std value: 3.5302
- type: nauc_recall_at_3_diff1 value: 43.5153
- type: nauc_recall_at_5_max value: 28.3517
- type: nauc_recall_at_5_std value: 6.458500000000001
- type: nauc_recall_at_5_diff1 value: 39.5587
- type: nauc_recall_at_10_max value: 29.2991
- type: nauc_recall_at_10_std value: 8.5119
- type: nauc_recall_at_10_diff1 value: 36.1111
- type: nauc_recall_at_20_max value: 30.984099999999998
- type: nauc_recall_at_20_std value: 10.668
- type: nauc_recall_at_20_diff1 value: 36.5424
- type: nauc_recall_at_100_max value: 28.0852
- type: nauc_recall_at_100_std value: 21.938
- type: nauc_recall_at_100_diff1 value: 32.5436
- type: nauc_recall_at_1000_max value: 33.8843
- type: nauc_recall_at_1000_std value: 40.677099999999996
- type: nauc_recall_at_1000_diff1 value: 28.95
- type: nauc_precision_at_1_max value: 34.191
- type: nauc_precision_at_1_std value: 0.2555
- type: nauc_precision_at_1_diff1 value: 55.12590000000001
- type: nauc_precision_at_3_max value: 28.9812
- type: nauc_precision_at_3_std value: 5.745299999999999
- type: nauc_precision_at_3_diff1 value: 38.4525
- type: nauc_precision_at_5_max value: 27.060200000000002
- type: nauc_precision_at_5_std value: 8.4729
- type: nauc_precision_at_5_diff1 value: 32.9266
- type: nauc_precision_at_10_max value: 25.7858
- type: nauc_precision_at_10_std value: 9.8897
- type: nauc_precision_at_10_diff1 value: 26.1021
- type: nauc_precision_at_20_max value: 26.243499999999997
- type: nauc_precision_at_20_std value: 12.251
- type: nauc_precision_at_20_diff1 value: 21.073800000000002
- type: nauc_precision_at_100_max value: 14.847199999999999
- type: nauc_precision_at_100_std value: 18.3256
- type: nauc_precision_at_100_diff1 value: 6.4467
- type: nauc_precision_at_1000_max value: 3.5059
- type: nauc_precision_at_1000_std value: 12.027000000000001
- type: nauc_precision_at_1000_diff1 value: -10.6274
- type: nauc_mrr_at_1_max value: 34.191
- type: nauc_mrr_at_1_std value: 0.2555
- type: nauc_mrr_at_1_diff1 value: 55.12590000000001
- type: nauc_mrr_at_3_max value: 32.2999
- type: nauc_mrr_at_3_std value: 1.8591
- type: nauc_mrr_at_3_diff1 value: 48.5279
- type: nauc_mrr_at_5_max value: 32.257799999999996
- type: nauc_mrr_at_5_std value: 2.8365
- type: nauc_mrr_at_5_diff1 value: 47.6701
- type: nauc_mrr_at_10_max value: 32.419399999999996
- type: nauc_mrr_at_10_std value: 3.0626
- type: nauc_mrr_at_10_diff1 value: 47.1638
- type: nauc_mrr_at_20_max value: 32.5848
- type: nauc_mrr_at_20_std value: 3.0636
- type: nauc_mrr_at_20_diff1 value: 47.218199999999996
- type: nauc_mrr_at_100_max value: 32.587500000000006
- type: nauc_mrr_at_100_std value: 3.2354000000000003
- type: nauc_mrr_at_100_diff1 value: 47.295
- type: nauc_mrr_at_1000_max value: 32.5994
- type: nauc_mrr_at_1000_std value: 3.2392999999999996
- type: nauc_mrr_at_1000_diff1 value: 47.3153
- type: main_score value: 32.482 task: type: Retrieval
- dataset:
config: default
name: MTEB ClimateFEVERHardNegatives (default)
revision: 3a309e201f3c2c4b13bd4a367a8f37eee2ec1d21
split: test
type: mteb/ClimateFEVER_test_top_250_only_w_correct-v2
metrics:
- type: ndcg_at_1 value: 14.099999999999998
- type: ndcg_at_3 value: 14.298
- type: ndcg_at_5 value: 16.078
- type: ndcg_at_10 value: 19.043
- type: ndcg_at_20 value: 21.663
- type: ndcg_at_100 value: 26.514
- type: ndcg_at_1000 value: 31.15
- type: map_at_1 value: 6.518
- type: map_at_3 value: 10.218
- type: map_at_5 value: 11.450000000000001
- type: map_at_10 value: 12.701
- type: map_at_20 value: 13.502
- type: map_at_100 value: 14.329
- type: map_at_1000 value: 14.560999999999998
- type: recall_at_1 value: 6.518
- type: recall_at_3 value: 14.197000000000001
- type: recall_at_5 value: 18.443
- type: recall_at_10 value: 25.233
- type: recall_at_20 value: 32.83
- type: recall_at_100 value: 51.82
- type: recall_at_1000 value: 78.238
- type: precision_at_1 value: 14.099999999999998
- type: precision_at_3 value: 10.767
- type: precision_at_5 value: 8.780000000000001
- type: precision_at_10 value: 6.2700000000000005
- type: precision_at_20 value: 4.22
- type: precision_at_100 value: 1.422
- type: precision_at_1000 value: 0.22899999999999998
- type: mrr_at_1 value: 14.099999999999998
- type: mrr_at_3 value: 21.099999999999998
- type: mrr_at_5 value: 22.855
- type: mrr_at_10 value: 24.427799999999998
- type: mrr_at_20 value: 25.1863
- type: mrr_at_100 value: 25.682899999999997
- type: mrr_at_1000 value: 25.749499999999998
- type: nauc_ndcg_at_1_max value: 17.3767
- type: nauc_ndcg_at_1_std value: 9.2458
- type: nauc_ndcg_at_1_diff1 value: 16.304199999999998
- type: nauc_ndcg_at_3_max value: 25.369999999999997
- type: nauc_ndcg_at_3_std value: 14.0289
- type: nauc_ndcg_at_3_diff1 value: 13.3376
- type: nauc_ndcg_at_5_max value: 25.8672
- type: nauc_ndcg_at_5_std value: 16.2133
- type: nauc_ndcg_at_5_diff1 value: 12.6441
- type: nauc_ndcg_at_10_max value: 27.3825
- type: nauc_ndcg_at_10_std value: 19.1307
- type: nauc_ndcg_at_10_diff1 value: 12.8491
- type: nauc_ndcg_at_20_max value: 28.402300000000004
- type: nauc_ndcg_at_20_std value: 19.024
- type: nauc_ndcg_at_20_diff1 value: 12.4925
- type: nauc_ndcg_at_100_max value: 31.1216
- type: nauc_ndcg_at_100_std value: 21.588099999999997
- type: nauc_ndcg_at_100_diff1 value: 11.2177
- type: nauc_ndcg_at_1000_max value: 31.4444
- type: nauc_ndcg_at_1000_std value: 21.7737
- type: nauc_ndcg_at_1000_diff1 value: 11.9895
- type: nauc_map_at_1_max value: 18.0146
- type: nauc_map_at_1_std value: 10.992799999999999
- type: nauc_map_at_1_diff1 value: 18.0204
- type: nauc_map_at_3_max value: 23.6696
- type: nauc_map_at_3_std value: 12.947600000000001
- type: nauc_map_at_3_diff1 value: 14.0274
- type: nauc_map_at_5_max value: 24.5524
- type: nauc_map_at_5_std value: 15.2125
- type: nauc_map_at_5_diff1 value: 13.4579
- type: nauc_map_at_10_max value: 25.3924
- type: nauc_map_at_10_std value: 16.769000000000002
- type: nauc_map_at_10_diff1 value: 13.725999999999999
- type: nauc_map_at_20_max value: 25.9845
- type: nauc_map_at_20_std value: 16.9583
- type: nauc_map_at_20_diff1 value: 13.5333
- type: nauc_map_at_100_max value: 26.674300000000002
- type: nauc_map_at_100_std value: 17.769099999999998
- type: nauc_map_at_100_diff1 value: 13.095399999999998
- type: nauc_map_at_1000_max value: 26.7523
- type: nauc_map_at_1000_std value: 17.8361
- type: nauc_map_at_1000_diff1 value: 13.153799999999999
- type: nauc_recall_at_1_max value: 18.0146
- type: nauc_recall_at_1_std value: 10.992799999999999
- type: nauc_recall_at_1_diff1 value: 18.0204
- type: nauc_recall_at_3_max value: 26.7331
- type: nauc_recall_at_3_std value: 13.608799999999999
- type: nauc_recall_at_3_diff1 value: 10.7863
- type: nauc_recall_at_5_max value: 26.235000000000003
- type: nauc_recall_at_5_std value: 16.8335
- type: nauc_recall_at_5_diff1 value: 9.4389
- type: nauc_recall_at_10_max value: 27.0233
- type: nauc_recall_at_10_std value: 20.7401
- type: nauc_recall_at_10_diff1 value: 9.589
- type: nauc_recall_at_20_max value: 27.3646
- type: nauc_recall_at_20_std value: 18.7408
- type: nauc_recall_at_20_diff1 value: 8.3524
- type: nauc_recall_at_100_max value: 31.565900000000003
- type: nauc_recall_at_100_std value: 22.7502
- type: nauc_recall_at_100_diff1 value: 3.5892
- type: nauc_recall_at_1000_max value: 35.854
- type: nauc_recall_at_1000_std value: 25.2455
- type: nauc_recall_at_1000_diff1 value: 5.25
- type: nauc_precision_at_1_max value: 17.3767
- type: nauc_precision_at_1_std value: 9.2458
- type: nauc_precision_at_1_diff1 value: 16.304199999999998
- type: nauc_precision_at_3_max value: 29.8514
- type: nauc_precision_at_3_std value: 17.3344
- type: nauc_precision_at_3_diff1 value: 12.7965
- type: nauc_precision_at_5_max value: 29.9122
- type: nauc_precision_at_5_std value: 22.0638
- type: nauc_precision_at_5_diff1 value: 10.9401
- type: nauc_precision_at_10_max value: 31.2731
- type: nauc_precision_at_10_std value: 26.3173
- type: nauc_precision_at_10_diff1 value: 10.0175
- type: nauc_precision_at_20_max value: 30.667
- type: nauc_precision_at_20_std value: 23.4944
- type: nauc_precision_at_20_diff1 value: 8.1778
- type: nauc_precision_at_100_max value: 30.5903
- type: nauc_precision_at_100_std value: 25.1048
- type: nauc_precision_at_100_diff1 value: 3.2702
- type: nauc_precision_at_1000_max value: 19.7081
- type: nauc_precision_at_1000_std value: 17.7857
- type: nauc_precision_at_1000_diff1 value: 2.1989
- type: nauc_mrr_at_1_max value: 17.3767
- type: nauc_mrr_at_1_std value: 9.2458
- type: nauc_mrr_at_1_diff1 value: 16.304199999999998
- type: nauc_mrr_at_3_max value: 24.1474
- type: nauc_mrr_at_3_std value: 13.4213
- type: nauc_mrr_at_3_diff1 value: 14.266300000000001
- type: nauc_mrr_at_5_max value: 23.8946
- type: nauc_mrr_at_5_std value: 13.9119
- type: nauc_mrr_at_5_diff1 value: 13.9569
- type: nauc_mrr_at_10_max value: 24.5762
- type: nauc_mrr_at_10_std value: 15.343699999999998
- type: nauc_mrr_at_10_diff1 value: 13.8355
- type: nauc_mrr_at_20_max value: 24.7856
- type: nauc_mrr_at_20_std value: 15.1997
- type: nauc_mrr_at_20_diff1 value: 13.9615
- type: nauc_mrr_at_100_max value: 24.913899999999998
- type: nauc_mrr_at_100_std value: 15.2973
- type: nauc_mrr_at_100_diff1 value: 13.9054
- type: nauc_mrr_at_1000_max value: 24.8602
- type: nauc_mrr_at_1000_std value: 15.264800000000001
- type: nauc_mrr_at_1000_diff1 value: 13.888200000000001
- type: main_score value: 19.043 task: type: Retrieval
- dataset:
config: default
name: MTEB FEVERHardNegatives (default)
revision: 080c9ed6267b65029207906e815d44a9240bafca
split: test
type: mteb/FEVER_test_top_250_only_w_correct-v2
metrics:
- type: ndcg_at_1 value: 47.099999999999994
- type: ndcg_at_3 value: 57.99100000000001
- type: ndcg_at_5 value: 60.948
- type: ndcg_at_10 value: 63.754999999999995
- type: ndcg_at_20 value: 65.649
- type: ndcg_at_100 value: 67.041
- type: ndcg_at_1000 value: 67.422
- type: map_at_1 value: 44.85
- type: map_at_3 value: 54.299
- type: map_at_5 value: 55.986000000000004
- type: map_at_10 value: 57.166
- type: map_at_20 value: 57.709999999999994
- type: map_at_100 value: 57.94200000000001
- type: map_at_1000 value: 57.964000000000006
- type: recall_at_1 value: 44.85
- type: recall_at_3 value: 65.917
- type: recall_at_5 value: 73.098
- type: recall_at_10 value: 81.54
- type: recall_at_20 value: 88.725
- type: recall_at_100 value: 95.53
- type: recall_at_1000 value: 97.989
- type: precision_at_1 value: 47.099999999999994
- type: precision_at_3 value: 23.333000000000002
- type: precision_at_5 value: 15.58
- type: precision_at_10 value: 8.73
- type: precision_at_20 value: 4.784999999999999
- type: precision_at_100 value: 1.048
- type: precision_at_1000 value: 0.11
- type: mrr_at_1 value: 47.099999999999994
- type: mrr_at_3 value: 56.9833
- type: mrr_at_5 value: 58.6933
- type: mrr_at_10 value: 59.913700000000006
- type: mrr_at_20 value: 60.4366
- type: mrr_at_100 value: 60.6124
- type: mrr_at_1000 value: 60.616800000000005
- type: nauc_ndcg_at_1_max value: 14.541100000000002
- type: nauc_ndcg_at_1_std value: -20.9154
- type: nauc_ndcg_at_1_diff1 value: 51.640699999999995
- type: nauc_ndcg_at_3_max value: 16.5821
- type: nauc_ndcg_at_3_std value: -21.64
- type: nauc_ndcg_at_3_diff1 value: 43.948
- type: nauc_ndcg_at_5_max value: 16.4971
- type: nauc_ndcg_at_5_std value: -20.849500000000003
- type: nauc_ndcg_at_5_diff1 value: 43.0631
- type: nauc_ndcg_at_10_max value: 15.839400000000001
- type: nauc_ndcg_at_10_std value: -21.0278
- type: nauc_ndcg_at_10_diff1 value: 43.7884
- type: nauc_ndcg_at_20_max value: 16.1081
- type: nauc_ndcg_at_20_std value: -19.7606
- type: nauc_ndcg_at_20_diff1 value: 44.4262
- type: nauc_ndcg_at_100_max value: 15.998899999999999
- type: nauc_ndcg_at_100_std value: -19.619500000000002
- type: nauc_ndcg_at_100_diff1 value: 44.5225
- type: nauc_ndcg_at_1000_max value: 16.069
- type: nauc_ndcg_at_1000_std value: -19.4906
- type: nauc_ndcg_at_1000_diff1 value: 44.4003
- type: nauc_map_at_1_max value: 12.4983
- type: nauc_map_at_1_std value: -19.7
- type: nauc_map_at_1_diff1 value: 48.598400000000005
- type: nauc_map_at_3_max value: 15.2542
- type: nauc_map_at_3_std value: -20.7008
- type: nauc_map_at_3_diff1 value: 44.5092
- type: nauc_map_at_5_max value: 15.273700000000002
- type: nauc_map_at_5_std value: -20.3894
- type: nauc_map_at_5_diff1 value: 44.1826
- type: nauc_map_at_10_max value: 15.004700000000001
- type: nauc_map_at_10_std value: -20.4971
- type: nauc_map_at_10_diff1 value: 44.428200000000004
- type: nauc_map_at_20_max value: 15.065000000000001
- type: nauc_map_at_20_std value: -20.189799999999998
- type: nauc_map_at_20_diff1 value: 44.5691
- type: nauc_map_at_100_max value: 15.0534
- type: nauc_map_at_100_std value: -20.1541
- type: nauc_map_at_100_diff1 value: 44.6102
- type: nauc_map_at_1000_max value: 15.058399999999999
- type: nauc_map_at_1000_std value: -20.1422
- type: nauc_map_at_1000_diff1 value: 44.6041
- type: nauc_recall_at_1_max value: 12.4983
- type: nauc_recall_at_1_std value: -19.7
- type: nauc_recall_at_1_diff1 value: 48.598400000000005
- type: nauc_recall_at_3_max value: 18.0779
- type: nauc_recall_at_3_std value: -21.8811
- type: nauc_recall_at_3_diff1 value: 37.594300000000004
- type: nauc_recall_at_5_max value: 18.074299999999997
- type: nauc_recall_at_5_std value: -19.465
- type: nauc_recall_at_5_diff1 value: 33.3804
- type: nauc_recall_at_10_max value: 15.118200000000002
- type: nauc_recall_at_10_std value: -19.464000000000002
- type: nauc_recall_at_10_diff1 value: 33.4801
- type: nauc_recall_at_20_max value: 17.180500000000002
- type: nauc_recall_at_20_std value: -7.6669
- type: nauc_recall_at_20_diff1 value: 33.8144
- type: nauc_recall_at_100_max value: 14.7357
- type: nauc_recall_at_100_std value: 10.3128
- type: nauc_recall_at_100_diff1 value: 22.4137
- type: nauc_recall_at_1000_max value: 22.8095
- type: nauc_recall_at_1000_std value: 48.4682
- type: nauc_recall_at_1000_diff1 value: -2.0866
- type: nauc_precision_at_1_max value: 14.541100000000002
- type: nauc_precision_at_1_std value: -20.9154
- type: nauc_precision_at_1_diff1 value: 51.640699999999995
- type: nauc_precision_at_3_max value: 20.513
- type: nauc_precision_at_3_std value: -25.9636
- type: nauc_precision_at_3_diff1 value: 40.8703
- type: nauc_precision_at_5_max value: 20.955
- type: nauc_precision_at_5_std value: -24.482400000000002
- type: nauc_precision_at_5_diff1 value: 36.600500000000004
- type: nauc_precision_at_10_max value: 18.8806
- type: nauc_precision_at_10_std value: -24.901200000000003
- type: nauc_precision_at_10_diff1 value: 35.8153
- type: nauc_precision_at_20_max value: 18.9481
- type: nauc_precision_at_20_std value: -10.5055
- type: nauc_precision_at_20_diff1 value: 29.369
- type: nauc_precision_at_100_max value: 14.1911
- type: nauc_precision_at_100_std value: 7.6478
- type: nauc_precision_at_100_diff1 value: 0.9292999999999999
- type: nauc_precision_at_1000_max value: 5.2714
- type: nauc_precision_at_1000_std value: 9.8453
- type: nauc_precision_at_1000_diff1 value: -11.8428
- type: nauc_mrr_at_1_max value: 14.541100000000002
- type: nauc_mrr_at_1_std value: -20.9154
- type: nauc_mrr_at_1_diff1 value: 51.640699999999995
- type: nauc_mrr_at_3_max value: 17.4433
- type: nauc_mrr_at_3_std value: -22.367600000000003
- type: nauc_mrr_at_3_diff1 value: 47.6952
- type: nauc_mrr_at_5_max value: 17.3538
- type: nauc_mrr_at_5_std value: -22.003
- type: nauc_mrr_at_5_diff1 value: 47.3432
- type: nauc_mrr_at_10_max value: 17.1856
- type: nauc_mrr_at_10_std value: -22.0944
- type: nauc_mrr_at_10_diff1 value: 47.6806
- type: nauc_mrr_at_20_max value: 17.2046
- type: nauc_mrr_at_20_std value: -21.7914
- type: nauc_mrr_at_20_diff1 value: 47.7943
- type: nauc_mrr_at_100_max value: 17.1348
- type: nauc_mrr_at_100_std value: -21.8049
- type: nauc_mrr_at_100_diff1 value: 47.7973
- type: nauc_mrr_at_1000_max value: 17.1388
- type: nauc_mrr_at_1000_std value: -21.8013
- type: nauc_mrr_at_1000_diff1 value: 47.7986
- type: main_score value: 63.754999999999995 task: type: Retrieval
- dataset:
config: default
name: MTEB FiQA2018 (default)
revision: 27a168819829fe9bcd655c2df245fb19452e8e06
split: test
type: mteb/fiqa
metrics:
- type: ndcg_at_1 value: 28.549000000000003
- type: ndcg_at_3 value: 26.496
- type: ndcg_at_5 value: 27.229999999999997
- type: ndcg_at_10 value: 29.284
- type: ndcg_at_20 value: 31.747999999999998
- type: ndcg_at_100 value: 35.562
- type: ndcg_at_1000 value: 39.553
- type: map_at_1 value: 13.969999999999999
- type: map_at_3 value: 19.826
- type: map_at_5 value: 21.349999999999998
- type: map_at_10 value: 22.842000000000002
- type: map_at_20 value: 23.71
- type: map_at_100 value: 24.383
- type: map_at_1000 value: 24.587999999999997
- type: recall_at_1 value: 13.969999999999999
- type: recall_at_3 value: 23.923
- type: recall_at_5 value: 28.166000000000004
- type: recall_at_10 value: 34.657
- type: recall_at_20 value: 42.445
- type: recall_at_100 value: 58.626999999999995
- type: recall_at_1000 value: 83.154
- type: precision_at_1 value: 28.549000000000003
- type: precision_at_3 value: 17.747
- type: precision_at_5 value: 13.056000000000001
- type: precision_at_10 value: 8.333
- type: precision_at_20 value: 5.154
- type: precision_at_100 value: 1.4569999999999999
- type: precision_at_1000 value: 0.216
- type: mrr_at_1 value: 28.549400000000002
- type: mrr_at_3 value: 34.5679
- type: mrr_at_5 value: 35.7407
- type: mrr_at_10 value: 36.619
- type: mrr_at_20 value: 37.141000000000005
- type: mrr_at_100 value: 37.5101
- type: mrr_at_1000 value: 37.5778
- type: nauc_ndcg_at_1_max value: 26.9011
- type: nauc_ndcg_at_1_std value: -4.1662
- type: nauc_ndcg_at_1_diff1 value: 36.0761
- type: nauc_ndcg_at_3_max value: 27.5647
- type: nauc_ndcg_at_3_std value: 1.3891
- type: nauc_ndcg_at_3_diff1 value: 32.8922
- type: nauc_ndcg_at_5_max value: 24.807299999999998
- type: nauc_ndcg_at_5_std value: 2.2724
- type: nauc_ndcg_at_5_diff1 value: 31.646
- type: nauc_ndcg_at_10_max value: 24.806800000000003
- type: nauc_ndcg_at_10_std value: 3.9619
- type: nauc_ndcg_at_10_diff1 value: 31.943899999999996
- type: nauc_ndcg_at_20_max value: 25.282
- type: nauc_ndcg_at_20_std value: 4.6921
- type: nauc_ndcg_at_20_diff1 value: 31.3257
- type: nauc_ndcg_at_100_max value: 27.206799999999998
- type: nauc_ndcg_at_100_std value: 7.2548
- type: nauc_ndcg_at_100_diff1 value: 30.402800000000003
- type: nauc_ndcg_at_1000_max value: 28.302699999999998
- type: nauc_ndcg_at_1000_std value: 7.4432
- type: nauc_ndcg_at_1000_diff1 value: 30.4145
- type: nauc_map_at_1_max value: 17.934900000000003
- type: nauc_map_at_1_std value: -4.075
- type: nauc_map_at_1_diff1 value: 41.3467
- type: nauc_map_at_3_max value: 22.6649
- type: nauc_map_at_3_std value: -0.0022
- type: nauc_map_at_3_diff1 value: 35.949799999999996
- type: nauc_map_at_5_max value: 22.2973
- type: nauc_map_at_5_std value: 1.1874
- type: nauc_map_at_5_diff1 value: 34.765
- type: nauc_map_at_10_max value: 23.472199999999997
- type: nauc_map_at_10_std value: 2.6841
- type: nauc_map_at_10_diff1 value: 34.2725
- type: nauc_map_at_20_max value: 24.009900000000002
- type: nauc_map_at_20_std value: 2.9796
- type: nauc_map_at_20_diff1 value: 34.0755
- type: nauc_map_at_100_max value: 24.5888
- type: nauc_map_at_100_std value: 3.5168999999999997
- type: nauc_map_at_100_diff1 value: 33.795700000000004
- type: nauc_map_at_1000_max value: 24.7001
- type: nauc_map_at_1000_std value: 3.6033999999999997
- type: nauc_map_at_1000_diff1 value: 33.7896
- type: nauc_recall_at_1_max value: 17.934900000000003
- type: nauc_recall_at_1_std value: -4.075
- type: nauc_recall_at_1_diff1 value: 41.3467
- type: nauc_recall_at_3_max value: 21.0507
- type: nauc_recall_at_3_std value: 1.6584999999999999
- type: nauc_recall_at_3_diff1 value: 30.5016
- type: nauc_recall_at_5_max value: 18.229100000000003
- type: nauc_recall_at_5_std value: 4.2212
- type: nauc_recall_at_5_diff1 value: 26.2222
- type: nauc_recall_at_10_max value: 18.9163
- type: nauc_recall_at_10_std value: 7.421600000000001
- type: nauc_recall_at_10_diff1 value: 25.0319
- type: nauc_recall_at_20_max value: 19.1985
- type: nauc_recall_at_20_std value: 9.6619
- type: nauc_recall_at_20_diff1 value: 22.0881
- type: nauc_recall_at_100_max value: 23.177400000000002
- type: nauc_recall_at_100_std value: 20.3361
- type: nauc_recall_at_100_diff1 value: 17.4315
- type: nauc_recall_at_1000_max value: 29.7752
- type: nauc_recall_at_1000_std value: 30.336600000000004
- type: nauc_recall_at_1000_diff1 value: 13.9819
- type: nauc_precision_at_1_max value: 26.9011
- type: nauc_precision_at_1_std value: -4.1662
- type: nauc_precision_at_1_diff1 value: 36.0761
- type: nauc_precision_at_3_max value: 31.3449
- type: nauc_precision_at_3_std value: 5.3401
- type: nauc_precision_at_3_diff1 value: 23.5782
- type: nauc_precision_at_5_max value: 29.545700000000004
- type: nauc_precision_at_5_std value: 7.859299999999999
- type: nauc_precision_at_5_diff1 value: 17.5104
- type: nauc_precision_at_10_max value: 31.787599999999998
- type: nauc_precision_at_10_std value: 12.7279
- type: nauc_precision_at_10_diff1 value: 15.021899999999999
- type: nauc_precision_at_20_max value: 31.782899999999998
- type: nauc_precision_at_20_std value: 13.050600000000001
- type: nauc_precision_at_20_diff1 value: 12.4427
- type: nauc_precision_at_100_max value: 33.4844
- type: nauc_precision_at_100_std value: 17.4908
- type: nauc_precision_at_100_diff1 value: 4.0221
- type: nauc_precision_at_1000_max value: 27.701199999999996
- type: nauc_precision_at_1000_std value: 13.0084
- type: nauc_precision_at_1000_diff1 value: -5.0355
- type: nauc_mrr_at_1_max value: 26.9011
- type: nauc_mrr_at_1_std value: -4.1662
- type: nauc_mrr_at_1_diff1 value: 36.0761
- type: nauc_mrr_at_3_max value: 26.51
- type: nauc_mrr_at_3_std value: -1.6091000000000002
- type: nauc_mrr_at_3_diff1 value: 32.0993
- type: nauc_mrr_at_5_max value: 26.502599999999997
- type: nauc_mrr_at_5_std value: -0.9911
- type: nauc_mrr_at_5_diff1 value: 31.578200000000002
- type: nauc_mrr_at_10_max value: 26.643099999999997
- type: nauc_mrr_at_10_std value: -0.46950000000000003
- type: nauc_mrr_at_10_diff1 value: 31.572899999999997
- type: nauc_mrr_at_20_max value: 26.511699999999998
- type: nauc_mrr_at_20_std value: -0.4706
- type: nauc_mrr_at_20_diff1 value: 31.4157
- type: nauc_mrr_at_100_max value: 26.5992
- type: nauc_mrr_at_100_std value: -0.3074
- type: nauc_mrr_at_100_diff1 value: 31.397000000000002
- type: nauc_mrr_at_1000_max value: 26.5961
- type: nauc_mrr_at_1000_std value: -0.3261
- type: nauc_mrr_at_1000_diff1 value: 31.418200000000002
- type: main_score value: 29.284 task: type: Retrieval
- dataset:
config: default
name: MTEB HotpotQAHardNegatives (default)
revision: 617612fa63afcb60e3b134bed8b7216a99707c37
split: test
type: mteb/HotpotQA_test_top_250_only_w_correct-v2
metrics:
- type: ndcg_at_1 value: 51.4
- type: ndcg_at_3 value: 39.722
- type: ndcg_at_5 value: 42.335
- type: ndcg_at_10 value: 45.302
- type: ndcg_at_20 value: 47.589999999999996
- type: ndcg_at_100 value: 51.339
- type: ndcg_at_1000 value: 54.042
- type: map_at_1 value: 25.7
- type: map_at_3 value: 32.975
- type: map_at_5 value: 34.707
- type: map_at_10 value: 36.212
- type: map_at_20 value: 37.03
- type: map_at_100 value: 37.718
- type: map_at_1000 value: 37.858999999999995
- type: recall_at_1 value: 25.7
- type: recall_at_3 value: 36.95
- type: recall_at_5 value: 42.1
- type: recall_at_10 value: 49.5
- type: recall_at_20 value: 56.85
- type: recall_at_100 value: 73.5
- type: recall_at_1000 value: 91.14999999999999
- type: precision_at_1 value: 51.4
- type: precision_at_3 value: 24.633
- type: precision_at_5 value: 16.84
- type: precision_at_10 value: 9.9
- type: precision_at_20 value: 5.685
- type: precision_at_100 value: 1.47
- type: precision_at_1000 value: 0.182
- type: mrr_at_1 value: 51.4
- type: mrr_at_3 value: 57.283300000000004
- type: mrr_at_5 value: 58.568299999999994
- type: mrr_at_10 value: 59.618700000000004
- type: mrr_at_20 value: 60.046200000000006
- type: mrr_at_100 value: 60.3154
- type: mrr_at_1000 value: 60.3441
- type: nauc_ndcg_at_1_max value: 45.0721
- type: nauc_ndcg_at_1_std value: -4.7617
- type: nauc_ndcg_at_1_diff1 value: 60.8946
- type: nauc_ndcg_at_3_max value: 41.3688
- type: nauc_ndcg_at_3_std value: -0.7188
- type: nauc_ndcg_at_3_diff1 value: 46.8131
- type: nauc_ndcg_at_5_max value: 40.6604
- type: nauc_ndcg_at_5_std value: 0.0927
- type: nauc_ndcg_at_5_diff1 value: 45.0972
- type: nauc_ndcg_at_10_max value: 40.6415
- type: nauc_ndcg_at_10_std value: 1.2045
- type: nauc_ndcg_at_10_diff1 value: 43.893100000000004
- type: nauc_ndcg_at_20_max value: 40.6535
- type: nauc_ndcg_at_20_std value: 2.9401
- type: nauc_ndcg_at_20_diff1 value: 43.762
- type: nauc_ndcg_at_100_max value: 42.9132
- type: nauc_ndcg_at_100_std value: 5.8547
- type: nauc_ndcg_at_100_diff1 value: 45.0353
- type: nauc_ndcg_at_1000_max value: 42.8897
- type: nauc_ndcg_at_1000_std value: 5.562
- type: nauc_ndcg_at_1000_diff1 value: 45.051
- type: nauc_map_at_1_max value: 45.0721
- type: nauc_map_at_1_std value: -4.7617
- type: nauc_map_at_1_diff1 value: 60.8946
- type: nauc_map_at_3_max value: 40.3619
- type: nauc_map_at_3_std value: 0.7892
- type: nauc_map_at_3_diff1 value: 43.7742
- type: nauc_map_at_5_max value: 39.857
- type: nauc_map_at_5_std value: 1.3318999999999999
- type: nauc_map_at_5_diff1 value: 42.768
- type: nauc_map_at_10_max value: 39.8836
- type: nauc_map_at_10_std value: 1.9564000000000001
- type: nauc_map_at_10_diff1 value: 42.2925
- type: nauc_map_at_20_max value: 39.8653
- type: nauc_map_at_20_std value: 2.4855
- type: nauc_map_at_20_diff1 value: 42.3024
- type: nauc_map_at_100_max value: 40.2949
- type: nauc_map_at_100_std value: 3.0113000000000003
- type: nauc_map_at_100_diff1 value: 42.6062
- type: nauc_map_at_1000_max value: 40.2828
- type: nauc_map_at_1000_std value: 3.0048
- type: nauc_map_at_1000_diff1 value: 42.6009
- type: nauc_recall_at_1_max value: 45.0721
- type: nauc_recall_at_1_std value: -4.7617
- type: nauc_recall_at_1_diff1 value: 60.8946
- type: nauc_recall_at_3_max value: 38.8376
- type: nauc_recall_at_3_std value: 1.5544
- type: nauc_recall_at_3_diff1 value: 39.1529
- type: nauc_recall_at_5_max value: 36.391400000000004
- type: nauc_recall_at_5_std value: 3.1532999999999998
- type: nauc_recall_at_5_diff1 value: 34.660000000000004
- type: nauc_recall_at_10_max value: 33.7108
- type: nauc_recall_at_10_std value: 5.743
- type: nauc_recall_at_10_diff1 value: 28.9605
- type: nauc_recall_at_20_max value: 32.0646
- type: nauc_recall_at_20_std value: 11.411999999999999
- type: nauc_recall_at_20_diff1 value: 26.562200000000004
- type: nauc_recall_at_100_max value: 39.3941
- type: nauc_recall_at_100_std value: 28.2403
- type: nauc_recall_at_100_diff1 value: 26.353700000000003
- type: nauc_recall_at_1000_max value: 43.751400000000004
- type: nauc_recall_at_1000_std value: 55.13249999999999
- type: nauc_recall_at_1000_diff1 value: 10.1938
- type: nauc_precision_at_1_max value: 45.0721
- type: nauc_precision_at_1_std value: -4.7617
- type: nauc_precision_at_1_diff1 value: 60.8946
- type: nauc_precision_at_3_max value: 38.8376
- type: nauc_precision_at_3_std value: 1.5544
- type: nauc_precision_at_3_diff1 value: 39.1529
- type: nauc_precision_at_5_max value: 36.391400000000004
- type: nauc_precision_at_5_std value: 3.1532999999999998
- type: nauc_precision_at_5_diff1 value: 34.660000000000004
- type: nauc_precision_at_10_max value: 33.7108
- type: nauc_precision_at_10_std value: 5.743
- type: nauc_precision_at_10_diff1 value: 28.9605
- type: nauc_precision_at_20_max value: 32.0646
- type: nauc_precision_at_20_std value: 11.411999999999999
- type: nauc_precision_at_20_diff1 value: 26.562200000000004
- type: nauc_precision_at_100_max value: 39.3941
- type: nauc_precision_at_100_std value: 28.2403
- type: nauc_precision_at_100_diff1 value: 26.353700000000003
- type: nauc_precision_at_1000_max value: 43.751400000000004
- type: nauc_precision_at_1000_std value: 55.13249999999999
- type: nauc_precision_at_1000_diff1 value: 10.1938
- type: nauc_mrr_at_1_max value: 45.0721
- type: nauc_mrr_at_1_std value: -4.7617
- type: nauc_mrr_at_1_diff1 value: 60.8946
- type: nauc_mrr_at_3_max value: 44.7879
- type: nauc_mrr_at_3_std value: -5.1337
- type: nauc_mrr_at_3_diff1 value: 58.2349
- type: nauc_mrr_at_5_max value: 44.6627
- type: nauc_mrr_at_5_std value: -4.9526
- type: nauc_mrr_at_5_diff1 value: 57.7376
- type: nauc_mrr_at_10_max value: 44.7676
- type: nauc_mrr_at_10_std value: -4.7908
- type: nauc_mrr_at_10_diff1 value: 57.537400000000005
- type: nauc_mrr_at_20_max value: 44.7882
- type: nauc_mrr_at_20_std value: -4.5173
- type: nauc_mrr_at_20_diff1 value: 57.575900000000004
- type: nauc_mrr_at_100_max value: 44.9292
- type: nauc_mrr_at_100_std value: -4.4029
- type: nauc_mrr_at_100_diff1 value: 57.6909
- type: nauc_mrr_at_1000_max value: 44.912800000000004
- type: nauc_mrr_at_1000_std value: -4.429
- type: nauc_mrr_at_1000_diff1 value: 57.6896
- type: main_score value: 45.302 task: type: Retrieval
- dataset:
config: default
name: MTEB ImdbClassification (default)
revision: 3d86128a09e091d6018b6d26cad27f2739fc2db7
split: test
type: mteb/imdb
metrics:
- type: accuracy value: 71.792
- type: f1 value: 71.6599
- type: f1_weighted value: 71.6599
- type: ap value: 65.6717
- type: ap_weighted value: 65.6717
- type: main_score value: 71.792 task: type: Classification
- dataset:
config: en
name: MTEB MTOPDomainClassification (en)
revision: d80d48c1eb48d3562165c59d59d0034df9fff0bf
split: test
type: mteb/mtop_domain
metrics:
- type: accuracy value: 90.798
- type: f1 value: 90.14569999999999
- type: f1_weighted value: 90.8211
- type: main_score value: 90.798 task: type: Classification
- dataset:
config: en
name: MTEB MassiveIntentClassification (en)
revision: 4672e20407010da34463acc759c162ca9734bca6
split: test
type: mteb/amazon_massive_intent
metrics:
- type: accuracy value: 66.4829
- type: f1 value: 64.3878
- type: f1_weighted value: 65.2855
- type: main_score value: 66.4829 task: type: Classification
- dataset:
config: en
name: MTEB MassiveScenarioClassification (en)
revision: fad2c6e8459f9e1c45d9315f4953d921437d70f8
split: test
type: mteb/amazon_massive_scenario
metrics:
- type: accuracy value: 71.1903
- type: f1 value: 71.0214
- type: f1_weighted value: 70.7184
- type: main_score value: 71.1903 task: type: Classification
- dataset:
config: default
name: MTEB MedrxivClusteringP2P.v2 (default)
revision: e7a26af6f3ae46b30dde8737f02c07b1505bcc73
split: test
type: mteb/medrxiv-clustering-p2p
metrics:
- type: v_measure value: 35.781
- type: v_measure_std value: 0.7404
- type: main_score value: 35.781 task: type: Clustering
- dataset:
config: default
name: MTEB MedrxivClusteringS2S.v2 (default)
revision: 35191c8c0dca72d8ff3efcd72aa802307d469663
split: test
type: mteb/medrxiv-clustering-s2s
metrics:
- type: v_measure value: 33.900200000000005
- type: v_measure_std value: 0.8489
- type: main_score value: 33.900200000000005 task: type: Clustering
- dataset:
config: default
name: MTEB MindSmallReranking (default)
revision: 59042f120c80e8afa9cdbb224f67076cec0fc9a7
split: test
type: mteb/mind_small
metrics:
- type: map value: 29.646499999999996
- type: mrr value: 30.604799999999997
- type: nAUC_map_max value: -23.3675
- type: nAUC_map_std value: -5.0637
- type: nAUC_map_diff1 value: 13.4632
- type: nAUC_mrr_max value: -17.5124
- type: nAUC_mrr_std value: -2.8459000000000003
- type: nAUC_mrr_diff1 value: 12.4125
- type: main_score value: 29.646499999999996 task: type: Reranking
- dataset:
config: default
name: MTEB SCIDOCS (default)
revision: f8c2fcf00f625baaa80f62ec5bd9e1fff3b8ae88
split: test
type: mteb/scidocs
metrics:
- type: ndcg_at_1 value: 20
- type: ndcg_at_3 value: 15.842
- type: ndcg_at_5 value: 13.894
- type: ndcg_at_10 value: 16.926
- type: ndcg_at_20 value: 19.803
- type: ndcg_at_100 value: 25.081999999999997
- type: ndcg_at_1000 value: 30.864000000000004
- type: map_at_1 value: 4.093
- type: map_at_3 value: 7.091
- type: map_at_5 value: 8.389000000000001
- type: map_at_10 value: 9.831
- type: map_at_20 value: 10.801
- type: map_at_100 value: 11.815000000000001
- type: map_at_1000 value: 12.139999999999999
- type: recall_at_1 value: 4.093
- type: recall_at_3 value: 8.938
- type: recall_at_5 value: 12.323
- type: recall_at_10 value: 17.907
- type: recall_at_20 value: 24.708
- type: recall_at_100 value: 41.897
- type: recall_at_1000 value: 70.048
- type: precision_at_1 value: 20
- type: precision_at_3 value: 14.667
- type: precision_at_5 value: 12.120000000000001
- type: precision_at_10 value: 8.81
- type: precision_at_20 value: 6.08
- type: precision_at_100 value: 2.061
- type: precision_at_1000 value: 0.345
- type: mrr_at_1 value: 20
- type: mrr_at_3 value: 26.016699999999997
- type: mrr_at_5 value: 27.896700000000003
- type: mrr_at_10 value: 29.309800000000003
- type: mrr_at_20 value: 30.1817
- type: mrr_at_100 value: 30.642999999999997
- type: mrr_at_1000 value: 30.7072
- type: nauc_ndcg_at_1_max value: 25.9162
- type: nauc_ndcg_at_1_std value: 7.375800000000001
- type: nauc_ndcg_at_1_diff1 value: 21.4553
- type: nauc_ndcg_at_3_max value: 29.9782
- type: nauc_ndcg_at_3_std value: 11.0489
- type: nauc_ndcg_at_3_diff1 value: 17.3996
- type: nauc_ndcg_at_5_max value: 31.5098
- type: nauc_ndcg_at_5_std value: 13.3131
- type: nauc_ndcg_at_5_diff1 value: 18.3321
- type: nauc_ndcg_at_10_max value: 33.3401
- type: nauc_ndcg_at_10_std value: 16.1576
- type: nauc_ndcg_at_10_diff1 value: 16.9853
- type: nauc_ndcg_at_20_max value: 34.343
- type: nauc_ndcg_at_20_std value: 20.0335
- type: nauc_ndcg_at_20_diff1 value: 15.6531
- type: nauc_ndcg_at_100_max value: 37.066500000000005
- type: nauc_ndcg_at_100_std value: 26.8663
- type: nauc_ndcg_at_100_diff1 value: 16.4485
- type: nauc_ndcg_at_1000_max value: 37.6377
- type: nauc_ndcg_at_1000_std value: 28.4086
- type: nauc_ndcg_at_1000_diff1 value: 16.598
- type: nauc_map_at_1_max value: 25.571899999999996
- type: nauc_map_at_1_std value: 7.2567
- type: nauc_map_at_1_diff1 value: 21.1815
- type: nauc_map_at_3_max value: 29.7213
- type: nauc_map_at_3_std value: 9.027000000000001
- type: nauc_map_at_3_diff1 value: 17.6405
- type: nauc_map_at_5_max value: 30.912499999999998
- type: nauc_map_at_5_std value: 10.8177
- type: nauc_map_at_5_diff1 value: 18.2512
- type: nauc_map_at_10_max value: 32.1247
- type: nauc_map_at_10_std value: 13.3522
- type: nauc_map_at_10_diff1 value: 17.0684
- type: nauc_map_at_20_max value: 32.8604
- type: nauc_map_at_20_std value: 15.534899999999999
- type: nauc_map_at_20_diff1 value: 16.3024
- type: nauc_map_at_100_max value: 33.9481
- type: nauc_map_at_100_std value: 17.9563
- type: nauc_map_at_100_diff1 value: 16.5858
- type: nauc_map_at_1000_max value: 34.104099999999995
- type: nauc_map_at_1000_std value: 18.3399
- type: nauc_map_at_1000_diff1 value: 16.5982
- type: nauc_recall_at_1_max value: 25.571899999999996
- type: nauc_recall_at_1_std value: 7.2567
- type: nauc_recall_at_1_diff1 value: 21.1815
- type: nauc_recall_at_3_max value: 31.102
- type: nauc_recall_at_3_std value: 12.208
- type: nauc_recall_at_3_diff1 value: 15.7802
- type: nauc_recall_at_5_max value: 33.0649
- type: nauc_recall_at_5_std value: 15.7429
- type: nauc_recall_at_5_diff1 value: 17.3206
- type: nauc_recall_at_10_max value: 34.0055
- type: nauc_recall_at_10_std value: 19.4785
- type: nauc_recall_at_10_diff1 value: 13.9128
- type: nauc_recall_at_20_max value: 34.4532
- type: nauc_recall_at_20_std value: 26.6761
- type: nauc_recall_at_20_diff1 value: 10.6585
- type: nauc_recall_at_100_max value: 36.5745
- type: nauc_recall_at_100_std value: 39.6888
- type: nauc_recall_at_100_diff1 value: 11.683
- type: nauc_recall_at_1000_max value: 33.799
- type: nauc_recall_at_1000_std value: 44.5965
- type: nauc_recall_at_1000_diff1 value: 9.332699999999999
- type: nauc_precision_at_1_max value: 25.9162
- type: nauc_precision_at_1_std value: 7.375800000000001
- type: nauc_precision_at_1_diff1 value: 21.4553
- type: nauc_precision_at_3_max value: 31.4508
- type: nauc_precision_at_3_std value: 12.4827
- type: nauc_precision_at_3_diff1 value: 15.9863
- type: nauc_precision_at_5_max value: 33.2365
- type: nauc_precision_at_5_std value: 15.9467
- type: nauc_precision_at_5_diff1 value: 17.3246
- type: nauc_precision_at_10_max value: 34.1244
- type: nauc_precision_at_10_std value: 19.545
- type: nauc_precision_at_10_diff1 value: 14.082600000000001
- type: nauc_precision_at_20_max value: 34.367399999999996
- type: nauc_precision_at_20_std value: 26.530199999999997
- type: nauc_precision_at_20_diff1 value: 10.7493
- type: nauc_precision_at_100_max value: 36.3502
- type: nauc_precision_at_100_std value: 39.5794
- type: nauc_precision_at_100_diff1 value: 11.6971
- type: nauc_precision_at_1000_max value: 32.6092
- type: nauc_precision_at_1000_std value: 43.249500000000005
- type: nauc_precision_at_1000_diff1 value: 9.149899999999999
- type: nauc_mrr_at_1_max value: 25.9162
- type: nauc_mrr_at_1_std value: 7.375800000000001
- type: nauc_mrr_at_1_diff1 value: 21.4553
- type: nauc_mrr_at_3_max value: 28.1601
- type: nauc_mrr_at_3_std value: 11.7872
- type: nauc_mrr_at_3_diff1 value: 18.1467
- type: nauc_mrr_at_5_max value: 29.1462
- type: nauc_mrr_at_5_std value: 12.9036
- type: nauc_mrr_at_5_diff1 value: 18.834899999999998
- type: nauc_mrr_at_10_max value: 29.837799999999998
- type: nauc_mrr_at_10_std value: 13.2935
- type: nauc_mrr_at_10_diff1 value: 18.7271
- type: nauc_mrr_at_20_max value: 29.808600000000002
- type: nauc_mrr_at_20_std value: 13.7856
- type: nauc_mrr_at_20_diff1 value: 18.6675
- type: nauc_mrr_at_100_max value: 29.7584
- type: nauc_mrr_at_100_std value: 13.8851
- type: nauc_mrr_at_100_diff1 value: 18.601
- type: nauc_mrr_at_1000_max value: 29.7331
- type: nauc_mrr_at_1000_std value: 13.8237
- type: nauc_mrr_at_1000_diff1 value: 18.6124
- type: main_score value: 16.926 task: type: Retrieval
- dataset:
config: default
name: MTEB SICK-R (default)
revision: 20a6d6f312dd54037fe07a32d58e5e168867909d
split: test
type: mteb/sickr-sts
metrics:
- type: pearson value: 84.7166
- type: spearman value: 80.3972
- type: cosine_pearson value: 84.7166
- type: cosine_spearman value: 80.3972
- type: manhattan_pearson value: 81.3592
- type: manhattan_spearman value: 80.4202
- type: euclidean_pearson value: 81.3441
- type: euclidean_spearman value: 80.3972
- type: main_score value: 80.3972 task: type: STS
- dataset:
config: default
name: MTEB STS12 (default)
revision: a0d554a64d88156834ff5ae9920b964011b16384
split: test
type: mteb/sts12-sts
metrics:
- type: pearson value: 86.7684
- type: spearman value: 78.7071
- type: cosine_pearson value: 86.7684
- type: cosine_spearman value: 78.70899999999999
- type: manhattan_pearson value: 83.7029
- type: manhattan_spearman value: 78.7584
- type: euclidean_pearson value: 83.604
- type: euclidean_spearman value: 78.70899999999999
- type: main_score value: 78.70899999999999 task: type: STS
- dataset:
config: default
name: MTEB STS13 (default)
revision: 7e90230a92c190f1bf69ae9002b8cea547a64cca
split: test
type: mteb/sts13-sts
metrics:
- type: pearson value: 85.1773
- type: spearman value: 86.1602
- type: cosine_pearson value: 85.1773
- type: cosine_spearman value: 86.1602
- type: manhattan_pearson value: 84.7533
- type: manhattan_spearman value: 86.0645
- type: euclidean_pearson value: 84.8639
- type: euclidean_spearman value: 86.1602
- type: main_score value: 86.1602 task: type: STS
- dataset:
config: default
name: MTEB STS14 (default)
revision: 6031580fec1f6af667f0bd2da0a551cf4f0b2375
split: test
type: mteb/sts14-sts
metrics:
- type: pearson value: 82.87780000000001
- type: spearman value: 81.2081
- type: cosine_pearson value: 82.87780000000001
- type: cosine_spearman value: 81.2081
- type: manhattan_pearson value: 81.89750000000001
- type: manhattan_spearman value: 81.2182
- type: euclidean_pearson value: 81.917
- type: euclidean_spearman value: 81.2081
- type: main_score value: 81.2081 task: type: STS
- dataset:
config: default
name: MTEB STS15 (default)
revision: ae752c7c21bf194d8b67fd573edf7ae58183cbe3
split: test
type: mteb/sts15-sts
metrics:
- type: pearson value: 86.9104
- type: spearman value: 87.5072
- type: cosine_pearson value: 86.9104
- type: cosine_spearman value: 87.5073
- type: manhattan_pearson value: 86.74849999999999
- type: manhattan_spearman value: 87.4643
- type: euclidean_pearson value: 86.7938
- type: euclidean_spearman value: 87.5072
- type: main_score value: 87.5073 task: type: STS
- dataset:
config: en-en
name: MTEB STS17 (en-en)
revision: faeb762787bd10488a50c8b5be4a3b82e411949c
split: test
type: mteb/sts17-crosslingual-sts
metrics:
- type: pearson value: 89.4941
- type: spearman value: 88.9712
- type: cosine_pearson value: 89.4941
- type: cosine_spearman value: 88.9712
- type: manhattan_pearson value: 89.04039999999999
- type: manhattan_spearman value: 89.05720000000001
- type: euclidean_pearson value: 89.0296
- type: euclidean_spearman value: 88.9712
- type: main_score value: 88.9712 task: type: STS
- dataset:
config: en
name: MTEB STS22.v2 (en)
revision: d31f33a128469b20e357535c39b82fb3c3f6f2bd
split: test
type: mteb/sts22-crosslingual-sts
metrics:
- type: pearson value: 66.6691
- type: spearman value: 65.5503
- type: cosine_pearson value: 66.6691
- type: cosine_spearman value: 65.5503
- type: manhattan_pearson value: 67.6732
- type: manhattan_spearman value: 65.2781
- type: euclidean_pearson value: 67.6466
- type: euclidean_spearman value: 65.5503
- type: main_score value: 65.5503 task: type: STS
- dataset:
config: default
name: MTEB STSBenchmark (default)
revision: b0fddb56ed78048fa8b90373c8a3cfc37b684831
split: test
type: mteb/stsbenchmark-sts
metrics:
- type: pearson value: 85.8143
- type: spearman value: 86.40339999999999
- type: cosine_pearson value: 85.8143
- type: cosine_spearman value: 86.40339999999999
- type: manhattan_pearson value: 86.0569
- type: manhattan_spearman value: 86.3744
- type: euclidean_pearson value: 86.0947
- type: euclidean_spearman value: 86.40339999999999
- type: main_score value: 86.40339999999999 task: type: STS
- dataset:
config: default
name: MTEB SprintDuplicateQuestions (default)
revision: d66bd1f72af766a5cc4b0ca5e00c162f89e8cc46
split: test
type: mteb/sprintduplicatequestions-pairclassification
metrics:
- type: similarity_accuracy value: 99.8
- type: similarity_accuracy_threshold value: 71.084
- type: similarity_f1 value: 89.7462
- type: similarity_f1_threshold value: 71.084
- type: similarity_precision value: 91.134
- type: similarity_recall value: 88.4
- type: similarity_ap value: 94.32199999999999
- type: cosine_accuracy value: 99.8
- type: cosine_accuracy_threshold value: 71.084
- type: cosine_f1 value: 89.7462
- type: cosine_f1_threshold value: 71.084
- type: cosine_precision value: 91.134
- type: cosine_recall value: 88.4
- type: cosine_ap value: 94.32199999999999
- type: manhattan_accuracy value: 99.7941
- type: manhattan_accuracy_threshold value: 1641.3430999999998
- type: manhattan_f1 value: 89.6245
- type: manhattan_f1_threshold value: 1705.1424000000002
- type: manhattan_precision value: 88.5742
- type: manhattan_recall value: 90.7
- type: manhattan_ap value: 94.22840000000001
- type: euclidean_accuracy value: 99.8
- type: euclidean_accuracy_threshold value: 76.0474
- type: euclidean_f1 value: 89.7462
- type: euclidean_f1_threshold value: 76.0474
- type: euclidean_precision value: 91.134
- type: euclidean_recall value: 88.4
- type: euclidean_ap value: 94.32199999999999
- type: dot_accuracy value: 99.8
- type: dot_accuracy_threshold value: 71.084
- type: dot_f1 value: 89.7462
- type: dot_f1_threshold value: 71.084
- type: dot_precision value: 91.134
- type: dot_recall value: 88.4
- type: dot_ap value: 94.32199999999999
- type: max_accuracy value: 99.8
- type: max_f1 value: 89.7462
- type: max_precision value: 91.134
- type: max_recall value: 90.7
- type: max_ap value: 94.32199999999999
- type: main_score value: 94.32199999999999 task: type: PairClassification
- dataset:
config: default
name: MTEB StackExchangeClustering.v2 (default)
revision: 6cbc1f7b2bc0622f2e39d2c77fa502909748c259
split: test
type: mteb/stackexchange-clustering
metrics:
- type: v_measure value: 53.5198
- type: v_measure_std value: 0.6015
- type: main_score value: 53.5198 task: type: Clustering
- dataset:
config: default
name: MTEB StackExchangeClusteringP2P.v2 (default)
revision: 815ca46b2622cec33ccafc3735d572c266efdb44
split: test
type: mteb/stackexchange-clustering-p2p
metrics:
- type: v_measure value: 40.029399999999995
- type: v_measure_std value: 0.4919
- type: main_score value: 40.029399999999995 task: type: Clustering
- dataset:
config: default
name: MTEB SummEvalSummarization.v2 (default)
revision: cda12ad7615edc362dbf25a00fdd61d3b1eaf93c
split: test
type: mteb/summeval
metrics:
- type: pearson value: 33.6198
- type: spearman value: 30.206699999999998
- type: cosine_spearman value: 30.206699999999998
- type: cosine_pearson value: 33.6198
- type: dot_spearman value: 30.206699999999998
- type: dot_pearson value: 33.6198
- type: main_score value: 30.206699999999998 task: type: Summarization
- dataset:
config: default
name: MTEB TRECCOVID (default)
revision: bb9466bac8153a0349341eb1b22e06409e78ef4e
split: test
type: mteb/trec-covid
metrics:
- type: ndcg_at_1 value: 63
- type: ndcg_at_3 value: 66.47999999999999
- type: ndcg_at_5 value: 61.090999999999994
- type: ndcg_at_10 value: 56.823
- type: ndcg_at_20 value: 53.21
- type: ndcg_at_100 value: 42.365
- type: ndcg_at_1000 value: 40.819
- type: map_at_1 value: 0.186
- type: map_at_3 value: 0.527
- type: map_at_5 value: 0.762
- type: map_at_10 value: 1.275
- type: map_at_20 value: 2.177
- type: map_at_100 value: 6.935
- type: map_at_1000 value: 16.973
- type: recall_at_1 value: 0.186
- type: recall_at_3 value: 0.581
- type: recall_at_5 value: 0.8710000000000001
- type: recall_at_10 value: 1.582
- type: recall_at_20 value: 2.897
- type: recall_at_100 value: 10.546
- type: recall_at_1000 value: 38.541
- type: precision_at_1 value: 68
- type: precision_at_3 value: 70.667
- type: precision_at_5 value: 63.2
- type: precision_at_10 value: 58.4
- type: precision_at_20 value: 54.400000000000006
- type: precision_at_100 value: 42.46
- type: precision_at_1000 value: 17.657999999999998
- type: mrr_at_1 value: 68
- type: mrr_at_3 value: 79
- type: mrr_at_5 value: 79.5
- type: mrr_at_10 value: 79.8333
- type: mrr_at_20 value: 80.0152
- type: mrr_at_100 value: 80.0152
- type: mrr_at_1000 value: 80.0152
- type: nauc_ndcg_at_1_max value: -5.9922
- type: nauc_ndcg_at_1_std value: 0.42110000000000003
- type: nauc_ndcg_at_1_diff1 value: 23.3553
- type: nauc_ndcg_at_3_max value: 10.2171
- type: nauc_ndcg_at_3_std value: 17.6509
- type: nauc_ndcg_at_3_diff1 value: 14.5183
- type: nauc_ndcg_at_5_max value: 23.7407
- type: nauc_ndcg_at_5_std value: 37.241
- type: nauc_ndcg_at_5_diff1 value: 18.1059
- type: nauc_ndcg_at_10_max value: 29.640300000000003
- type: nauc_ndcg_at_10_std value: 41.2782
- type: nauc_ndcg_at_10_diff1 value: 8.6037
- type: nauc_ndcg_at_20_max value: 40.3419
- type: nauc_ndcg_at_20_std value: 52.5532
- type: nauc_ndcg_at_20_diff1 value: 8.1576
- type: nauc_ndcg_at_100_max value: 51.4533
- type: nauc_ndcg_at_100_std value: 69.6289
- type: nauc_ndcg_at_100_diff1 value: -3.2301
- type: nauc_ndcg_at_1000_max value: 56.962900000000005
- type: nauc_ndcg_at_1000_std value: 74.6131
- type: nauc_ndcg_at_1000_diff1 value: -8.241999999999999
- type: nauc_map_at_1_max value: -4.668
- type: nauc_map_at_1_std value: -10.0497
- type: nauc_map_at_1_diff1 value: 23.029700000000002
- type: nauc_map_at_3_max value: 0.6419
- type: nauc_map_at_3_std value: 1.0362
- type: nauc_map_at_3_diff1 value: 14.8847
- type: nauc_map_at_5_max value: 10.632
- type: nauc_map_at_5_std value: 14.382200000000001
- type: nauc_map_at_5_diff1 value: 17.8863
- type: nauc_map_at_10_max value: 16.8052
- type: nauc_map_at_10_std value: 21.084500000000002
- type: nauc_map_at_10_diff1 value: 15.3248
- type: nauc_map_at_20_max value: 27.3457
- type: nauc_map_at_20_std value: 34.2901
- type: nauc_map_at_20_diff1 value: 11.4443
- type: nauc_map_at_100_max value: 49.5995
- type: nauc_map_at_100_std value: 65.1028
- type: nauc_map_at_100_diff1 value: -1.8796
- type: nauc_map_at_1000_max value: 60.618399999999994
- type: nauc_map_at_1000_std value: 76.28399999999999
- type: nauc_map_at_1000_diff1 value: -13.772100000000002
- type: nauc_recall_at_1_max value: -4.668
- type: nauc_recall_at_1_std value: -10.0497
- type: nauc_recall_at_1_diff1 value: 23.029700000000002
- type: nauc_recall_at_3_max value: 0.0493
- type: nauc_recall_at_3_std value: 2.2468
- type: nauc_recall_at_3_diff1 value: 16.5914
- type: nauc_recall_at_5_max value: 9.1725
- type: nauc_recall_at_5_std value: 14.597999999999999
- type: nauc_recall_at_5_diff1 value: 18.6063
- type: nauc_recall_at_10_max value: 13.672400000000001
- type: nauc_recall_at_10_std value: 15.9268
- type: nauc_recall_at_10_diff1 value: 16.3772
- type: nauc_recall_at_20_max value: 21.4077
- type: nauc_recall_at_20_std value: 27.209
- type: nauc_recall_at_20_diff1 value: 14.8917
- type: nauc_recall_at_100_max value: 42.282799999999995
- type: nauc_recall_at_100_std value: 57.6084
- type: nauc_recall_at_100_diff1 value: 2.6269
- type: nauc_recall_at_1000_max value: 54.055
- type: nauc_recall_at_1000_std value: 68.8306
- type: nauc_recall_at_1000_diff1 value: -9.5473
- type: nauc_precision_at_1_max value: -1.8693000000000002
- type: nauc_precision_at_1_std value: -5.061800000000001
- type: nauc_precision_at_1_diff1 value: 39.6344
- type: nauc_precision_at_3_max value: 20.2643
- type: nauc_precision_at_3_std value: 23.1419
- type: nauc_precision_at_3_diff1 value: 20.305999999999997
- type: nauc_precision_at_5_max value: 35.8846
- type: nauc_precision_at_5_std value: 48.295
- type: nauc_precision_at_5_diff1 value: 22.5559
- type: nauc_precision_at_10_max value: 39.8361
- type: nauc_precision_at_10_std value: 46.245000000000005
- type: nauc_precision_at_10_diff1 value: 6.433800000000001
- type: nauc_precision_at_20_max value: 47.9467
- type: nauc_precision_at_20_std value: 57.981
- type: nauc_precision_at_20_diff1 value: 7.721699999999999
- type: nauc_precision_at_100_max value: 55.6948
- type: nauc_precision_at_100_std value: 71.6681
- type: nauc_precision_at_100_diff1 value: -5.4666
- type: nauc_precision_at_1000_max value: 49.0064
- type: nauc_precision_at_1000_std value: 56.2352
- type: nauc_precision_at_1000_diff1 value: -17.4375
- type: nauc_mrr_at_1_max value: -1.8693000000000002
- type: nauc_mrr_at_1_std value: -5.061800000000001
- type: nauc_mrr_at_1_diff1 value: 39.6344
- type: nauc_mrr_at_3_max value: 7.8541
- type: nauc_mrr_at_3_std value: 7.0844000000000005
- type: nauc_mrr_at_3_diff1 value: 44.6714
- type: nauc_mrr_at_5_max value: 7.070600000000001
- type: nauc_mrr_at_5_std value: 6.2793
- type: nauc_mrr_at_5_diff1 value: 43.1205
- type: nauc_mrr_at_10_max value: 5.829899999999999
- type: nauc_mrr_at_10_std value: 4.7435
- type: nauc_mrr_at_10_diff1 value: 42.8864
- type: nauc_mrr_at_20_max value: 4.8414
- type: nauc_mrr_at_20_std value: 3.7436
- type: nauc_mrr_at_20_diff1 value: 42.9607
- type: nauc_mrr_at_100_max value: 4.8414
- type: nauc_mrr_at_100_std value: 3.7436
- type: nauc_mrr_at_100_diff1 value: 42.9607
- type: nauc_mrr_at_1000_max value: 4.8414
- type: nauc_mrr_at_1000_std value: 3.7436
- type: nauc_mrr_at_1000_diff1 value: 42.9607
- type: main_score value: 56.823 task: type: Retrieval
- dataset:
config: default
name: MTEB Touche2020Retrieval.v3 (default)
revision: 431886eaecc48f067a3975b70d0949ea2862463c
split: test
type: mteb/webis-touche2020-v3
metrics:
- type: ndcg_at_1 value: 52.041000000000004
- type: ndcg_at_3 value: 52.178000000000004
- type: ndcg_at_5 value: 52.23100000000001
- type: ndcg_at_10 value: 47.693999999999996
- type: ndcg_at_20 value: 43.242999999999995
- type: ndcg_at_100 value: 51.503
- type: ndcg_at_1000 value: 63.939
- type: map_at_1 value: 2.407
- type: map_at_3 value: 6.193
- type: map_at_5 value: 9.617
- type: map_at_10 value: 15.279000000000002
- type: map_at_20 value: 21.498
- type: map_at_100 value: 30.198999999999998
- type: map_at_1000 value: 33.217
- type: recall_at_1 value: 2.407
- type: recall_at_3 value: 6.762
- type: recall_at_5 value: 11.392
- type: recall_at_10 value: 19.333
- type: recall_at_20 value: 30.013
- type: recall_at_100 value: 56.041
- type: recall_at_1000 value: 86.126
- type: precision_at_1 value: 61.224000000000004
- type: precision_at_3 value: 63.26500000000001
- type: precision_at_5 value: 62.449
- type: precision_at_10 value: 52.245
- type: precision_at_20 value: 42.041000000000004
- type: precision_at_100 value: 17.653
- type: precision_at_1000 value: 2.9819999999999998
- type: mrr_at_1 value: 61.224500000000006
- type: mrr_at_3 value: 74.1497
- type: mrr_at_5 value: 76.4966
- type: mrr_at_10 value: 76.7881
- type: mrr_at_20 value: 76.7881
- type: mrr_at_100 value: 76.7881
- type: mrr_at_1000 value: 76.7881
- type: nauc_ndcg_at_1_max value: 11.4245
- type: nauc_ndcg_at_1_std value: -14.1654
- type: nauc_ndcg_at_1_diff1 value: 8.206299999999999
- type: nauc_ndcg_at_3_max value: 9.2585
- type: nauc_ndcg_at_3_std value: -11.469999999999999
- type: nauc_ndcg_at_3_diff1 value: 16.437099999999997
- type: nauc_ndcg_at_5_max value: 4.9696
- type: nauc_ndcg_at_5_std value: -0.6109
- type: nauc_ndcg_at_5_diff1 value: 27.5214
- type: nauc_ndcg_at_10_max value: -1.3538
- type: nauc_ndcg_at_10_std value: -6.0539000000000005
- type: nauc_ndcg_at_10_diff1 value: 37.565799999999996
- type: nauc_ndcg_at_20_max value: -3.3665000000000003
- type: nauc_ndcg_at_20_std value: 0.364
- type: nauc_ndcg_at_20_diff1 value: 37.418800000000005
- type: nauc_ndcg_at_100_max value: -7.1732000000000005
- type: nauc_ndcg_at_100_std value: 6.9091
- type: nauc_ndcg_at_100_diff1 value: 31.342799999999997
- type: nauc_ndcg_at_1000_max value: 4.9213
- type: nauc_ndcg_at_1000_std value: 27.2304
- type: nauc_ndcg_at_1000_diff1 value: 26.5774
- type: nauc_map_at_1_max value: -10.1278
- type: nauc_map_at_1_std value: -30.9116
- type: nauc_map_at_1_diff1 value: 47.6006
- type: nauc_map_at_3_max value: -9.9654
- type: nauc_map_at_3_std value: -26.4025
- type: nauc_map_at_3_diff1 value: 40.3311
- type: nauc_map_at_5_max value: -10.3545
- type: nauc_map_at_5_std value: -21.662699999999997
- type: nauc_map_at_5_diff1 value: 46.1136
- type: nauc_map_at_10_max value: -9.528
- type: nauc_map_at_10_std value: -21.3903
- type: nauc_map_at_10_diff1 value: 41.5027
- type: nauc_map_at_20_max value: -7.0028999999999995
- type: nauc_map_at_20_std value: -15.9361
- type: nauc_map_at_20_diff1 value: 42.6171
- type: nauc_map_at_100_max value: -2.8579
- type: nauc_map_at_100_std value: -4.1692
- type: nauc_map_at_100_diff1 value: 35.200900000000004
- type: nauc_map_at_1000_max value: -0.1717
- type: nauc_map_at_1000_std value: 1.4015
- type: nauc_map_at_1000_diff1 value: 34.1462
- type: nauc_recall_at_1_max value: -10.1278
- type: nauc_recall_at_1_std value: -30.9116
- type: nauc_recall_at_1_diff1 value: 47.6006
- type: nauc_recall_at_3_max value: -9.7092
- type: nauc_recall_at_3_std value: -26.067800000000002
- type: nauc_recall_at_3_diff1 value: 44.094100000000005
- type: nauc_recall_at_5_max value: -16.8476
- type: nauc_recall_at_5_std value: -21.546799999999998
- type: nauc_recall_at_5_diff1 value: 51.0826
- type: nauc_recall_at_10_max value: -19.3996
- type: nauc_recall_at_10_std value: -23.857400000000002
- type: nauc_recall_at_10_diff1 value: 43.743900000000004
- type: nauc_recall_at_20_max value: -17.413500000000003
- type: nauc_recall_at_20_std value: -13.7552
- type: nauc_recall_at_20_diff1 value: 41.761900000000004
- type: nauc_recall_at_100_max value: -13.270399999999999
- type: nauc_recall_at_100_std value: 12.9632
- type: nauc_recall_at_100_diff1 value: 25.7781
- type: nauc_recall_at_1000_max value: 4.5253000000000005
- type: nauc_recall_at_1000_std value: 71.75280000000001
- type: nauc_recall_at_1000_diff1 value: 9.0837
- type: nauc_precision_at_1_max value: 26.4969
- type: nauc_precision_at_1_std value: -21.090600000000002
- type: nauc_precision_at_1_diff1 value: 25.671899999999997
- type: nauc_precision_at_3_max value: 17.132
- type: nauc_precision_at_3_std value: -14.341999999999999
- type: nauc_precision_at_3_diff1 value: 27.7326
- type: nauc_precision_at_5_max value: 10.6548
- type: nauc_precision_at_5_std value: 2.9193000000000002
- type: nauc_precision_at_5_diff1 value: 38.373400000000004
- type: nauc_precision_at_10_max value: 1.3576
- type: nauc_precision_at_10_std value: -3.8871
- type: nauc_precision_at_10_diff1 value: 33.6879
- type: nauc_precision_at_20_max value: 4.9846
- type: nauc_precision_at_20_std value: 16.8654
- type: nauc_precision_at_20_diff1 value: 25.1747
- type: nauc_precision_at_100_max value: 32.9312
- type: nauc_precision_at_100_std value: 50.7741
- type: nauc_precision_at_100_diff1 value: -19.561700000000002
- type: nauc_precision_at_1000_max value: 44.7539
- type: nauc_precision_at_1000_std value: 50.897800000000004
- type: nauc_precision_at_1000_diff1 value: -34.477999999999994
- type: nauc_mrr_at_1_max value: 26.4969
- type: nauc_mrr_at_1_std value: -21.090600000000002
- type: nauc_mrr_at_1_diff1 value: 25.671899999999997
- type: nauc_mrr_at_3_max value: 36.031600000000005
- type: nauc_mrr_at_3_std value: -9.915799999999999
- type: nauc_mrr_at_3_diff1 value: 32.4812
- type: nauc_mrr_at_5_max value: 32.5212
- type: nauc_mrr_at_5_std value: -10.443
- type: nauc_mrr_at_5_diff1 value: 31.8118
- type: nauc_mrr_at_10_max value: 31.4955
- type: nauc_mrr_at_10_std value: -11.698
- type: nauc_mrr_at_10_diff1 value: 30.974400000000003
- type: nauc_mrr_at_20_max value: 31.4955
- type: nauc_mrr_at_20_std value: -11.698
- type: nauc_mrr_at_20_diff1 value: 30.974400000000003
- type: nauc_mrr_at_100_max value: 31.4955
- type: nauc_mrr_at_100_std value: -11.698
- type: nauc_mrr_at_100_diff1 value: 30.974400000000003
- type: nauc_mrr_at_1000_max value: 31.4955
- type: nauc_mrr_at_1000_std value: -11.698
- type: nauc_mrr_at_1000_diff1 value: 30.974400000000003
- type: main_score value: 47.693999999999996 task: type: Retrieval
- dataset:
config: default
name: MTEB ToxicConversationsClassification (default)
revision: edfaf9da55d3dd50d43143d90c1ac476895ae6de
split: test
type: mteb/toxic_conversations_50k
metrics:
- type: accuracy value: 65.65429999999999
- type: f1 value: 50.530699999999996
- type: f1_weighted value: 73.3205
- type: ap value: 12.0938
- type: ap_weighted value: 12.0938
- type: main_score value: 65.65429999999999 task: type: Classification
- dataset:
config: default
name: MTEB TweetSentimentExtractionClassification (default)
revision: d604517c81ca91fe16a244d1248fc021f9ecee7a
split: test
type: mteb/tweet_sentiment_extraction
metrics:
- type: accuracy value: 61.7119
- type: f1 value: 61.8672
- type: f1_weighted value: 60.762499999999996
- type: main_score value: 61.7119 task: type: Classification
- dataset:
config: default
name: MTEB TwentyNewsgroupsClustering.v2 (default)
revision: 6125ec4e24fa026cec8a478383ee943acfbd5449
split: test
type: mteb/twentynewsgroups-clustering
metrics:
- type: v_measure value: 37.4338
- type: v_measure_std value: 1.5165
- type: main_score value: 37.4338 task: type: Clustering
- dataset:
config: default
name: MTEB TwitterSemEval2015 (default)
revision: 70970daeab8776df92f5ea462b6173c0b46fd2d1
split: test
type: mteb/twittersemeval2015-pairclassification
metrics:
- type: similarity_accuracy value: 82.8873
- type: similarity_accuracy_threshold value: 67.9403
- type: similarity_f1 value: 60.3641
- type: similarity_f1_threshold value: 60.5738
- type: similarity_precision value: 55.887600000000006
- type: similarity_recall value: 65.62010000000001
- type: similarity_ap value: 63.522
- type: cosine_accuracy value: 82.8873
- type: cosine_accuracy_threshold value: 67.9403
- type: cosine_f1 value: 60.3641
- type: cosine_f1_threshold value: 60.5738
- type: cosine_precision value: 55.887600000000006
- type: cosine_recall value: 65.62010000000001
- type: cosine_ap value: 63.522
- type: manhattan_accuracy value: 82.8098
- type: manhattan_accuracy_threshold value: 1739.439
- type: manhattan_f1 value: 60.1751
- type: manhattan_f1_threshold value: 1961.5566000000001
- type: manhattan_precision value: 54.5474
- type: manhattan_recall value: 67.0976
- type: manhattan_ap value: 63.42100000000001
- type: euclidean_accuracy value: 82.8873
- type: euclidean_accuracy_threshold value: 80.07459999999999
- type: euclidean_f1 value: 60.3641
- type: euclidean_f1_threshold value: 88.7989
- type: euclidean_precision value: 55.887600000000006
- type: euclidean_recall value: 65.62010000000001
- type: euclidean_ap value: 63.522
- type: dot_accuracy value: 82.8873
- type: dot_accuracy_threshold value: 67.9403
- type: dot_f1 value: 60.3641
- type: dot_f1_threshold value: 60.5738
- type: dot_precision value: 55.887600000000006
- type: dot_recall value: 65.62010000000001
- type: dot_ap value: 63.522
- type: max_accuracy value: 82.8873
- type: max_f1 value: 60.3641
- type: max_precision value: 55.887600000000006
- type: max_recall value: 67.0976
- type: max_ap value: 63.522
- type: main_score value: 63.522 task: type: PairClassification
- dataset:
config: default
name: MTEB TwitterURLCorpus (default)
revision: 8b6510b0b1fa4e4c4f879467980e9be563ec1cdf
split: test
type: mteb/twitterurlcorpus-pairclassification
metrics:
- type: similarity_accuracy value: 88.7337
- type: similarity_accuracy_threshold value: 62.43729999999999
- type: similarity_f1 value: 77.8938
- type: similarity_f1_threshold value: 59.013400000000004
- type: similarity_precision value: 74.31309999999999
- type: similarity_recall value: 81.83709999999999
- type: similarity_ap value: 85.1691
- type: cosine_accuracy value: 88.7337
- type: cosine_accuracy_threshold value: 62.43729999999999
- type: cosine_f1 value: 77.8938
- type: cosine_f1_threshold value: 59.013400000000004
- type: cosine_precision value: 74.31309999999999
- type: cosine_recall value: 81.83709999999999
- type: cosine_ap value: 85.1691
- type: manhattan_accuracy value: 88.689
- type: manhattan_accuracy_threshold value: 1888.1997999999999
- type: manhattan_f1 value: 77.8453
- type: manhattan_f1_threshold value: 1974.1371000000001
- type: manhattan_precision value: 74.6414
- type: manhattan_recall value: 81.3366
- type: manhattan_ap value: 85.0954
- type: euclidean_accuracy value: 88.7337
- type: euclidean_accuracy_threshold value: 86.6749
- type: euclidean_f1 value: 77.8938
- type: euclidean_f1_threshold value: 90.53909999999999
- type: euclidean_precision value: 74.31309999999999
- type: euclidean_recall value: 81.83709999999999
- type: euclidean_ap value: 85.1691
- type: dot_accuracy value: 88.7337
- type: dot_accuracy_threshold value: 62.43729999999999
- type: dot_f1 value: 77.8938
- type: dot_f1_threshold value: 59.013400000000004
- type: dot_precision value: 74.31309999999999
- type: dot_recall value: 81.83709999999999
- type: dot_ap value: 85.1691
- type: max_accuracy value: 88.7337
- type: max_f1 value: 77.8938
- type: max_precision value: 74.6414
- type: max_recall value: 81.83709999999999
- type: max_ap value: 85.1691
- type: main_score value: 85.1691 task: type: PairClassification license: apache-2.0
- dataset:
config: en
name: MTEB AmazonCounterfactualClassification (en)
revision: e8379541af4e31359cca9fbcf4b00f2671dba205
split: test
type: mteb/amazon_counterfactual
metrics:
RetrievaEmbedding-01: AMBER
The AMBER (Adaptive Multitask Bilingual Embedding Representations) is a text embedding model trained by Retrieva, Inc. This model is primarily designed for Japanese, but it also supports English. We trained this model on various datasets related to Japanese and English.
This model size is 315M parameters (large size).
Model Details
Model Description
The AMBER model is a text embedding model based on the sbintuitions/modernbert-ja-310m architecture, designed for Japanese text. This model was trained on a variety of datasets related to Japanese, and also includes English datasets. The model can be used for English text as well. During training, prompts (instructions) in natural language were included, allowing the model to generate embeddings tailored to specific tasks.
- Developed by: Retrieva, Inc.
- Model type: Based on the ModernBERT Architecture.
- Language(s) (NLP): Primarily Japanese (optional support for English).
- License: Apache 2.0
- Finetuned from model:
sbintuitions/modernbert-ja-310m
- Model Type: Sentence Transformer
- Maximum Sequence Length: 512 tokens
- Output Dimensionality: 768 dimensions
- Similarity Function: Cosine Similarity
Uses
How to Get Started with the Model
Install Library
First install the python library using pip:
pip install sentence-transformers sentencepiece
Run Inference
Then you can load this model and run inference.
You can specify the prompt at inference time by adding an argument called prompt
to model.encode
.
The prompts used in the Japanese benchmark are described in jmteb/tasks
, and the prompts used in the English benchmark are described in mteb/models/retrieva_en.py
.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("retrieva-jp/amber-large")
# Run inference
queries = [
"自然言語処理とはなんですか?",
"株式会社レトリバについて教えて",
]
documents = [
"自然言語処理(しぜんげんごしょり、英語: Natural language processing、略称:NLP)は、人間が日常的に使っている自然言語をコンピュータに処理させる一連の技術であり、人工知能と言語学の一分野である。",
"株式会社レトリバは、自然言語処理と機械学習を核としたAI技術で組織の課題解決を支援するテクノロジー企業である。",
]
queries_embeddings = model.encode(queries, prompt_name="Retrieval-query")
documents_embeddings = model.encode(documents, prompt_name="Retrieval-passage")
similarities = model.similarity(queries_embeddings, documents_embeddings)
print(similarities.shape)
Training Details
Training Data
We used multiple datasets to train this model. We selected datasets from llm-jp-eval, llm-japanese-dataset, and hpprc/emb for Japanese datasets. For English datasets, we mainly used some of the datasets utilized in Asai et al. (2023). Additionally, we partially used the English datasets at the sentence-transformers repository and kilt-tasks. To consider cross-lingual between Japanese and English, we also used translation datasets between Japanese and English.
For Japanese, we used synthetic data created by LLM to prepare a sufficient amount of training data.
Evaluation
We evaluated the model on the following benchmarks:
- Japanese Benchmark: JMTEB
- Japanese Retrieval Tasks: JQaRA, JaCWIR, MLDR Japanese Subset
- English Benchmark: MTEB(eng, v2).
The scores in the table are all calculated by us unless otherwise noted.
Japanese Benchmark: JMTEB
Note that the Mean (TaskType)
in the following leaderboard is the same as the Avg.
in the original JMTEB leaderboard.
The files used for evaluation are stored in the jmteb
directory.
Model | # Parameters | Mean (TaskType) | Mean (Task) | Retrieval | STS | Classification | Reranking | Clustering | PairClassification |
---|---|---|---|---|---|---|---|---|---|
base models | < 300M | ||||||||
cl-nagoya/ruri-base | 111M | 72.60 | 71.56 | 69.53 | 82.87 | 75.49 | 92.91 | 52.40 | 62.38 |
AMBER-base | 130M | 72.12 | 72.12 | 73.40 | 77.81 | 76.14 | 93.27 | 48.05 | 64.03 |
pkshatech/GLuCoSE-base-ja-v2 | 133M | 72.89 | 72.47 | 73.03 | 82.96 | 74.02 | 93.01 | 51.96 | 62.37 |
pkshatech/RoSEtta-base-ja | 190M | 72.49 | 72.05 | 73.14 | 81.39 | 72.37 | 92.69 | 53.60 | 61.74 |
intfloat/multilingual-e5-base | 278M | 71.11 | 69.72 | 69.45 | 80.45 | 69.86 | 92.90 | 51.62 | 62.35 |
large models | 300M < | ||||||||
AMBER-large (this model) |
315M | 72.52 | 73.22 | 75.40 | 79.32 | 77.14 | 93.54 | 48.73 | 60.97 |
cl-nagoya/ruri-large | 337M | 73.20 | 73.06 | 72.86 | 83.14 | 77.15 | 93.00 | 50.78 | 62.29 |
intfloat/multilingual-e5-large | 560M | 72.06 | 71.29 | 71.71 | 80.87 | 72.45 | 93.29 | 51.59 | 62.42 |
Japanese Retrieval Tasks: JQaRA, JaCWIR, MLDR Japanese Subset
The files used for MLDR are stored in the mldr
directory.
The prompts used in JQaRA and JaCWIR are Retrieval-query
and Retrieval-passage
described in config_sentence_transformers.json
.
Model | # Parameters | JQaRA (nDCG@10) | JaCWIR (MAP@10) | MLDR Japanese Subset (nDCG@10) |
---|---|---|---|---|
base models | < 300M | |||
cl-nagoya/ruri-base | 111M | 58.4 | 83.3 | 32.77 |
AMBER-base | 130M | 57.1 | 81.6 | 35.69 |
pkshatech/GLuCoSE-base-ja-v2 | 133M | 60.6 | 85.3 | 33.99 |
intfloat/multilingual-e5-base | 278M | 47.1 | 85.3 | 25.46 |
large models | 300M < | |||
AMBER-large (this model) |
315M | 62.5 | 82.4 | 34.57 |
cl-nagoya/ruri-large | 337M | 62.8 | 82.5 | 34.78 |
intfloat/multilingual-e5-large | 560M | 55.4 | 87.3 | 29.95 |
English Benchmark: MTEB(eng, v2)
The files used for evaluation are stored in the mteb
directory.
Model | # Parameters | Mean (TaskType) | Mean (Task) | Retrieval | STS | Classification | Reranking | Clustering | PairClassification | Summarization |
---|---|---|---|---|---|---|---|---|---|---|
base models | < 300M | |||||||||
AMBER-base | 130M | 54.75 | 58.20 | 40.11 | 81.29 | 70.39 | 42.98 | 42.27 | 80.12 | 26.08 |
intfloat/multilingual-e5-base | 278M | 56.21 | 59.75 | 43.22 | 80.50 | 73.84 | 43.87 | 42.19 | 83.74 | 26.10 |
large models | 300M < | |||||||||
AMBER-large (this model) |
315M | 56.08 | 59.13 | 41.04 | 81.52 | 72.23 | 43.83 | 42.71 | 81.00 | 30.21 |
intfloat/multilingual-e5-large | 560M | 57.06 | 60.84 | 46.17 | 81.11 | 74.88 | 44.31 | 41.91 | 84.33 | 26.67 |
Citation
BibTeX:
@inproceedings{amber2025,
title = {インストラクションと複数タスクを利用した日本語向け分散表現モデルの構築},
author = {勝又智 and 木村大翼 and 西鳥羽二郎},
booktitle = {言語処理学会第31回年次大会発表論文集},
year = {2025},
}
More Information
https://note.com/retrieva/n/n4ee9d304f44d (in Japanese)
Model Card Authors
Satoru Katsumata, Daisuke Kimura, Jiro Nishitoba
Model Card Contact
pr[at]retrieva.jp







