LdIR - Qwen2 - reranker - 1.5Bオープンソースモデル - 効率的に中国語の医療質問応答と一般的なテキストの再ランキングをサポート

ホーム

Ldir Qwen2 Reranker 1.5B

neofungによって開発

Qwen2-1.5Bに基づく下流タスクモデルで、再ランキングタスクに特化し、中国語医療質問応答と一般テキストの再ランキングタスクで優れた性能を発揮します。

テキスト埋め込み

Transformers

複数言語対応オープンソースライセンス:Apache-2.0 #中国語質問応答の再ランキング #医療情報検索 #15億パラメータ規模

ダウンロード数 51

リリース時間 : 8/13/2024

モデル概要

このモデルはQwen2-1.5Bをベースに開発された再ランキングモデルで、主に検索システムの関連性ランキングの効果を向上させるために使用され、特に中国語医療質問応答シナリオでの性能が最適化されています。

モデル特徴

中国語医療質問応答の最適化

CMedQA医療質問応答データセットで優れた性能を発揮し、MAP指標が86.5以上に達します。

マルチタスク適合

一般テキストや医療分野を含むさまざまな再ランキングタスクをサポートします。

効率的な推論

FP16加速とマルチGPU並列計算をサポートします。

モデル能力

テキスト関連性の再ランキング

医療質問応答の最適化

異言語間の再ランキング

使用事例

情報検索

医療質問応答システム

医療質問応答システムにおける回答のランキング品質を向上させます。

CMedQAv1データセットでMRRが88.91に達します。

検索エンジンの最適化

検索エンジンの結果の関連性ランキングを改善します。

MMarcoデータセットでMAPが39.35に達します。

🚀 LdIR-Qwen2-reranker-1.5B

このモデルは、質問応答の再ランキングタスクに特化したモデルです。Qwen2-1.5Bをベースに構築され、C-MTEBベンチマークで高い性能を発揮します。

🚀 クイックスタート

このモデルを使用する前に、必要な依存関係をインストールする必要があります。以下のコマンドを使用して、依存関係をインストールします。

transformers==4.41.2
flash-attn==2.5.7

✨ 主な機能

Qwen2-1.5Bベース: Qwen/Qwen2-1.5B を事前学習モデルとして使用しています。
再ランキングタスク: FlagEmbedding reranker のアプローチを活用し、質問応答の再ランキングタスクに特化しています。

📦 インストール

必要な依存関係をインストールするには、以下のコマンドを実行します。

transformers==4.41.2
flash-attn==2.5.7

💻 使用例

基本的な使用法

from typing import cast, List, Union, Tuple, Dict, Optional
import numpy as np
import torch
from tqdm import tqdm
import transformers
from transformers import AutoTokenizer, PreTrainedModel, PreTrainedTokenizer, DataCollatorWithPadding
from transformers.models.qwen2 import Qwen2Config, Qwen2ForSequenceClassification
from transformers.trainer_pt_utils import LabelSmoother
IGNORE_TOKEN_ID = LabelSmoother.ignore_index

def preprocess(
    sources,
    tokenizer: transformers.PreTrainedTokenizer,
    max_len: int = 1024,
) -> Dict:

    # Apply prompt templates
    input_ids, attention_masks = [], []
    for i, source in enumerate(sources):
        messages = [
            {"role": "user",
            "content": "\n\n".join(source)}
        ]
        text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
        model_inputs = tokenizer([text])
        input_id = model_inputs['input_ids'][0]
        attention_mask = model_inputs['attention_mask'][0]
        if len(input_id) > max_len:
            ## last five tokens: <|im_end|>(151645), \n(198), <|im_start|>(151644), assistant(77091), \n(198)
            diff = len(input_id) - max_len
            input_id = input_id[:-5-diff] + input_id[-5:]
            attention_mask = attention_mask[:-5-diff] + attention_mask[-5:]
            assert len(input_id) == max_len
        input_ids.append(input_id)
        attention_masks.append(attention_mask)

    return dict(
        input_ids=input_ids,
        attention_mask=attention_masks
    )

class FlagRerankerCustom:
    def __init__(
            self,
            model: PreTrainedModel,
            tokenizer: PreTrainedTokenizer,
            use_fp16: bool = False
    ) -> None:
        self.tokenizer = tokenizer
        self.model = model
        self.data_collator = DataCollatorWithPadding(tokenizer=tokenizer)

        if torch.cuda.is_available():
            self.device = torch.device('cuda')
        elif torch.backends.mps.is_available():
            self.device = torch.device('mps')
        else:
            self.device = torch.device('cpu')
            use_fp16 = False
        if use_fp16:
            self.model.half()

        self.model = self.model.to(self.device)

        self.model.eval()

        self.num_gpus = torch.cuda.device_count()
        if self.num_gpus > 1:
            print(f"----------using {self.num_gpus}*GPUs----------")
            self.model = torch.nn.DataParallel(self.model)

    @torch.no_grad()
    def compute_score(self, sentence_pairs: Union[List[Tuple[str, str]], Tuple[str, str]], batch_size: int = 64,
                      max_length: int = 1024) -> List[float]:
        
        if self.num_gpus > 0:
            batch_size = batch_size * self.num_gpus

        assert isinstance(sentence_pairs, list)
        if isinstance(sentence_pairs[0], str):
            sentence_pairs = [sentence_pairs]

        all_scores = []
        for start_index in tqdm(range(0, len(sentence_pairs), batch_size), desc="Compute Scores",
                                disable=True):
            sentences_batch = sentence_pairs[start_index:start_index + batch_size]
            inputs = preprocess(sources=sentences_batch, tokenizer=self.tokenizer, max_len=max_length)
            inputs = [dict(zip(inputs, t)) for t in zip(*inputs.values())]
            inputs = self.data_collator(inputs).to(self.device)
            scores = self.model(**inputs, return_dict=True).logits
            scores = scores.squeeze()
            all_scores.extend(scores.detach().to(torch.float).cpu().numpy().tolist())

        if len(all_scores) == 1:
            return all_scores[0]
        return all_scores

tokenizer = transformers.AutoTokenizer.from_pretrained(
    "neofung/LdIR-Qwen2-reranker-1.5B",
    padding_side="right",
)

config = Qwen2Config.from_pretrained(
    "neofung/LdIR-Qwen2-reranker-1.5B",
    trust_remote_code=True,
    bf16=True,
)

model = Qwen2ForSequenceClassification.from_pretrained(
    "neofung/LdIR-Qwen2-reranker-1.5B",
    config = config,
    trust_remote_code = True,
)

model = FlagRerankerCustom(model=model, tokenizer=tokenizer, use_fp16=False)

pairs = [['what is panda?', 'hi'], ['what is panda?', 'The giant panda (Ailuropoda melanoleuca), sometimes called a panda bear or simply panda, is a bear species endemic to China.']]

model.compute_score(pairs)

# [-2.655318021774292, 11.7670316696167]

高度な使用法

from C_MTEB.tasks import *
from mteb import MTEB

save_name = "LdIR-Qwen2-reranker-1.5B"

evaluation = MTEB(
    task_types=["Reranking"], task_langs=['zh', 'zh2en', 'en2zh']
    )

evaluation.run(model, output_folder=f"reranker_results/{save_name}")

📚 ドキュメント

モデルの評価

以下は、C-MTEBベンチマークでの評価結果です。

タスク	データセット	MAP	MRR
Reranking	C-MTEB/CMedQAv1-reranking	86.50438688414654	88.91170634920635
Reranking	C-MTEB/CMedQAv2-reranking	87.10592353383732	89.10178571428571
Reranking	C-MTEB/Mmarco-reranking	39.354813242907133	39.075793650793655
Reranking	C-MTEB/T2Reranking	68.83696915006163	79.77644651857584