deberta-v3-large-zeroshot-v1开源模型 - 零样本多分类任务的实用高效解决方案

首页

Deberta V3 Large Zeroshot V1

由 MoritzLaurer 开发

专为零样本分类任务设计的DeBERTa-v3模型，在多种分类任务上表现优异

文本分类

Transformers

英语开源协议:MIT #零样本分类 #多任务微调 #NLI重构

下载量 10.72k

发布时间 : 10/3/2023

模型简介

该模型用于零样本文本分类任务，通过自然语言推理(NLI)方式判断文本与给定标签的关联性

模型特点

零样本分类能力

无需特定任务训练即可对新类别进行分类

多任务训练

在27个任务和310个类别的混合数据集上进行训练

通用任务格式

将分类任务转化为自然语言推理(NLI)格式，判断文本与标签的蕴涵关系

模型能力

文本分类

零样本学习

多标签分类

使用案例

情感分析

评论情感分类

对商品评论进行正面/负面分类

在AmazonPolarity等数据集上表现良好

内容审核

有害内容检测

识别文本中的仇恨言论、侮辱性内容等

在WikiToxic等数据集上训练

主题分类

新闻分类

将新闻文章分类到不同主题

在AGNews等数据集上训练

🚀 deberta-v3-large-zeroshot-v1

该模型专为使用Hugging Face管道进行零样本分类而设计，在零样本分类任务上，相比Hugging Face hub上作者的其他零样本模型有显著提升。它能够完成一项通用任务：根据给定文本判断假设是否为true或not_true（也称为entailment与not_entailment）。

🚀 快速开始

简单的零样本分类管道

from transformers import pipeline
classifier = pipeline("zero-shot-classification", model="MoritzLaurer/deberta-v3-large-zeroshot-v1")
sequence_to_classify = "Angela Merkel is a politician in Germany and leader of the CDU"
candidate_labels = ["politics", "economy", "entertainment", "environment"]
output = classifier(sequence_to_classify, candidate_labels, multi_label=False)
print(output)

✨ 主要特性

专为零样本分类设计，可通过Hugging Face管道使用。
相比作者在Hugging Face hub上的其他零样本模型，在零样本分类任务上表现更优。
能够完成通用任务，可将任何分类任务转化为判断假设是否为true或not_true的任务。

📦 安装指南

文档未提及具体安装步骤，可参考Hugging Face相关文档进行安装。

💻 使用示例

基础用法

from transformers import pipeline
classifier = pipeline("zero-shot-classification", model="MoritzLaurer/deberta-v3-large-zeroshot-v1")
sequence_to_classify = "Angela Merkel is a politician in Germany and leader of the CDU"
candidate_labels = ["politics", "economy", "entertainment", "environment"]
output = classifier(sequence_to_classify, candidate_labels, multi_label=False)
print(output)

📚 详细文档

模型描述

该模型专为使用Hugging Face管道进行零样本分类而设计。在零样本分类方面，该模型比作者在Hugging Face hub上的其他零样本模型（https://huggingface.co/MoritzLaurer ）有显著提升。

该模型可以完成一项通用任务：根据给定文本（也称为前提）判断一个假设是true还是not_true（也称为entailment与not_entailment）。此任务格式基于自然语言推理任务（NLI）。该任务具有通用性，任何分类任务都可以重新表述为这个任务。

训练数据

该模型在27个任务和310个类别的混合数据上进行训练，这些数据已被重新格式化为通用格式。

26个分类任务，约400k文本： 'amazonpolarity', 'imdb', 'appreviews', 'yelpreviews', 'rottentomatoes', 'emotiondair', 'emocontext', 'empathetic', 'financialphrasebank', 'banking77', 'massive', 'wikitoxic_toxicaggregated', 'wikitoxic_obscene', 'wikitoxic_threat', 'wikitoxic_insult', 'wikitoxic_identityhate', 'hateoffensive', 'hatexplain', 'biasframes_offensive', 'biasframes_sex', 'biasframes_intent', 'agnews', 'yahootopics', 'trueteacher', 'spam', 'wellformedquery'. 每个数据集的详细信息请见：https://docs.google.com/spreadsheets/d/1Z18tMh02IiWgh6o8pfoMiI_LH4IXpr78wd_nmNd5FaE/edit?usp=sharing
五个NLI数据集，约885k文本："mnli", "anli", "fever", "wanli", "ling"

请注意，与其他NLI模型相比，该模型预测两个类别（entailment与not_entailment），而不是三个类别（entailment/neutral/contradiction）

数据和训练详情

准备数据以及训练和评估模型的代码完全开源，地址为：https://github.com/MoritzLaurer/zeroshot-classifier/tree/main

🔧 技术细节

该模型基于自然语言推理任务（NLI），能够将任何分类任务转化为判断假设是否为true或not_true的任务。在训练时，使用了27个任务和310个类别的混合数据，这些数据被重新格式化为通用格式。

📄 许可证

基础模型（DeBERTa-v3）根据MIT许可证发布。模型微调所用的数据集根据不同的许可证发布。以下电子表格提供了用于微调的非NLI数据集的概述，其中包含许可证、基础论文等信息：https://docs.google.com/spreadsheets/d/1Z18tMh02IiWgh6o8pfoMiI_LH4IXpr78wd_nmNd5FaE/edit?usp=sharing

此外，该模型还在以下NLI数据集上进行了训练：MNLI、ANLI、WANLI、LING-NLI、FEVER-NLI。

引用

如果使用此模型，请引用：

@article{laurer_less_2023,
	title = {Less {Annotating}, {More} {Classifying}: {Addressing} the {Data} {Scarcity} {Issue} of {Supervised} {Machine} {Learning} with {Deep} {Transfer} {Learning} and {BERT}-{NLI}},
	issn = {1047-1987, 1476-4989},
	shorttitle = {Less {Annotating}, {More} {Classifying}},
	url = {https://www.cambridge.org/core/product/identifier/S1047198723000207/type/journal_article},
	doi = {10.1017/pan.2023.20},
	language = {en},
	urldate = {2023-06-20},
	journal = {Political Analysis},
	author = {Laurer, Moritz and Van Atteveldt, Wouter and Casas, Andreu and Welbers, Kasper},
	month = jun,
	year = {2023},
	pages = {1--33},
}