e5-base-v2-mnli-anli開源模型 - 免費部署助力零樣本分類與自然語言推理

首頁

E5 Base V2 Mnli Anli

由mjwong開發

該模型是基於intfloat/e5-base-v2在GLUE（MNLI）和ANLI數據集上微調的版本，適用於零樣本分類和自然語言推理任務。

文本分類

Transformers

英語開源協議:MIT #零樣本分類 #自然語言推理 #多輪對話理解

下載量 6,598

發布時間 : 7/23/2023

模型概述

通過弱監督對比預訓練生成的文本嵌入模型，主要用於自然語言推理和零樣本分類任務。

模型特點

零樣本分類能力

支持無需特定任務訓練即可對文本進行分類

自然語言推理

能夠判斷兩個句子之間的邏輯關係（蘊含/中立/矛盾）

多數據集微調

在GLUE(MNLI)和ANLI數據集上進行微調，提升推理能力

模型能力

文本分類

自然語言推理

零樣本學習

使用案例

文本分析

情感分類

無需訓練即可對文本情感進行分類

主題分類

識別文本所屬的主題類別

邏輯推理

文本一致性判斷

判斷兩個句子之間的邏輯關係

在MNLI和ANLI數據集上表現良好

🚀 e5-base-v2-mnli-anli

本模型是 intfloat/e5-base-v2 在 glue (mnli) 和 anli 數據集上的微調版本。它可用於零樣本分類任務，為文本分類提供了高效且準確的解決方案。

✨ 主要特性

基於 Text Embeddings by Weakly-Supervised Contrastive Pre-training 論文的研究成果。
作者包括 Liang Wang、Nan Yang、Xiaolong Huang、Binxing Jiao、Linjun Yang、Daxin Jiang、Rangan Majumder、Furu Wei ，於 arXiv 2022 發佈。

📦 安裝指南

文檔未提及具體安裝步驟，可參考 transformers 庫的官方安裝說明進行安裝。

💻 使用示例

基礎用法

使用 zero-shot-classification 管道加載模型：

from transformers import pipeline
classifier = pipeline("zero-shot-classification",
                      model="mjwong/e5-base-v2-mnli-anli")

使用該管道將序列分類到指定的類別名稱中：

sequence_to_classify = "one day I will see the world"
candidate_labels = ['travel', 'cooking', 'dancing']
classifier(sequence_to_classify, candidate_labels)

如果有多個候選標籤可能正確，可傳遞 multi_class=True 獨立計算每個類別：

candidate_labels = ['travel', 'cooking', 'dancing', 'exploration']
classifier(sequence_to_classify, candidate_labels, multi_class=True)

高級用法

將模型應用於 NLI 任務：

import torch
from transformers import AutoTokenizer, AutoModelForSequenceClassification

# device = "cuda:0" or "cpu"
device = torch.device("cuda") if torch.cuda.is_available() else torch.device("cpu")

model_name = "mjwong/e5-base-v2-mnli-anli"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)

premise = "But I thought you'd sworn off coffee."
hypothesis = "I thought that you vowed to drink more coffee."

input = tokenizer(premise, hypothesis, truncation=True, return_tensors="pt")
output = model(input["input_ids"].to(device))
prediction = torch.softmax(output["logits"][0], -1).tolist()
label_names = ["entailment", "neutral", "contradiction"]
prediction = {name: round(float(pred) * 100, 2) for pred, name in zip(prediction, label_names)}
print(prediction)

📚 詳細文檔

評估結果

模型使用 MultiNLI 的開發集和 ANLI 的測試集進行評估，使用的指標是準確率。

數據集	mnli_dev_m	mnli_dev_mm	anli_test_r1	anli_test_r2	anli_test_r3
e5-base-v2-mnli-anli	0.812	0.809	0.557	0.460	0.448
e5-large-mnli	0.868	0.869	0.301	0.296	0.294
e5-large-mnli-anli	0.843	0.848	0.646	0.484	0.458
e5-large-v2-mnli	0.875	0.876	0.354	0.298	0.313
e5-large-v2-mnli-anli	0.846	0.848	0.638	0.474	0.479