bart-large-mnli开源零样本分类模型 - 免费部署实现快速文本类别判断

首页

Bart Large Mnli

由 facebook 开发

基于BART-large架构，在MultiNLI数据集上微调的零样本分类模型

大型语言模型开源协议:MIT #零样本分类 #多标签推理 #NLI微调

下载量 3.7M

发布时间 : 3/2/2022

模型简介

该模型是在MultiNLI数据集上微调的BART-large模型，专门用于零样本文本分类任务。通过自然语言推理(NLI)的方式，可将文本分类到任意自定义类别。

模型特点

零样本分类能力

无需微调即可将文本分类到任意自定义类别

基于NLI的灵活分类

通过构建假设语句实现开放式分类

多标签支持

可同时识别文本中的多个相关类别

模型能力

零样本文本分类

自然语言推理

多标签分类

使用案例

文本分类

新闻分类

将新闻自动分类到自定义主题类别

在示例中显示高达99%的准确率

内容审核

识别文本内容所属的敏感类别

🚀 bart-large-mnli

这是 bart-large 在 MultiNLI (MNLI) 数据集上训练后的检查点，可用于零样本分类任务。

🚀 快速开始

本模型是在 MultiNLI (MNLI) 数据集上对 bart-large 进行训练后得到的检查点。

关于此模型的更多信息：

✨ 主要特性

基于NLI的零样本文本分类

Yin 等人提出了一种将预训练的自然语言推理（NLI）模型用作现成的零样本序列分类器的方法。该方法的工作原理是将待分类的序列作为 NLI 的前提，并从每个候选标签构建一个假设。例如，如果我们想评估一个序列是否属于 “政治” 类别，我们可以构建一个假设 This text is about politics.。然后将蕴含和矛盾的概率转换为标签概率。

在许多情况下，这种方法出奇地有效，特别是与像 BART 和 Roberta 这样的大型预训练模型一起使用时。有关此方法和其他零样本方法的更详细介绍，请参阅此博客文章。

💻 使用示例

基础用法

使用零样本分类管道

可以使用 zero-shot-classification 管道加载模型：

from transformers import pipeline
classifier = pipeline("zero-shot-classification",
                      model="facebook/bart-large-mnli")

然后可以使用此管道将序列分类到你指定的任何类别名称中。

sequence_to_classify = "one day I will see the world"
candidate_labels = ['travel', 'cooking', 'dancing']
classifier(sequence_to_classify, candidate_labels)
#{'labels': ['travel', 'dancing', 'cooking'],
# 'scores': [0.9938651323318481, 0.0032737774308770895, 0.002861034357920289],
# 'sequence': 'one day I will see the world'}

如果多个候选标签可能正确，可以传递 multi_label=True 来独立计算每个类别的概率：

candidate_labels = ['travel', 'cooking', 'dancing', 'exploration']
classifier(sequence_to_classify, candidate_labels, multi_label=True)
#{'labels': ['travel', 'exploration', 'dancing', 'cooking'],
# 'scores': [0.9945111274719238,
#  0.9383890628814697,
#  0.0057061901316046715,
#  0.0018193122232332826],
# 'sequence': 'one day I will see the world'}

高级用法

使用手动 PyTorch 代码

# pose sequence as a NLI premise and label as a hypothesis
from transformers import AutoModelForSequenceClassification, AutoTokenizer
nli_model = AutoModelForSequenceClassification.from_pretrained('facebook/bart-large-mnli')
tokenizer = AutoTokenizer.from_pretrained('facebook/bart-large-mnli')

premise = sequence
hypothesis = f'This example is {label}.'

# run through model pre-trained on MNLI
x = tokenizer.encode(premise, hypothesis, return_tensors='pt',
                     truncation_strategy='only_first')
logits = nli_model(x.to(device))[0]

# we throw away "neutral" (dim 1) and take the probability of
# "entailment" (2) as the probability of the label being true 
entail_contradiction_logits = logits[:,[0,2]]
probs = entail_contradiction_logits.softmax(dim=1)
prob_label_is_true = probs[:,1]