deberta-v3-base-zeroshot-v1 Open-source Model - Supports Zero-shot Classification of Multi-domain Texts

Deberta V3 Base Zeroshot V1

Developed by MoritzLaurer

DeBERTa-v3 model specifically designed for zero-shot classification, trained on 27 tasks and 310 categories, supporting multi-domain text classification

Text Classification

Transformers

EnglishOpen Source License:MIT #Zero-shot classification #Multi-task NLI #Textual entailment

Downloads 908

Release Time : 9/29/2023

Model Overview

This model is based on the DeBERTa-v3 architecture, specially optimized for zero-shot classification capabilities. By reformulating various tasks as natural language inference (NLI) problems, it can perform multiple text classification tasks without task-specific fine-tuning.

Model Features

Zero-shot classification capability

Can perform various text classification tasks without task-specific fine-tuning

Multi-task training

Trained on 27 different tasks and 310 categories, covering a wide range of domains

NLI task reformulation

Reformulates classification tasks as natural language inference problems to enhance generalization

Binary classification optimization

Focuses on entailment/non-entailment binary classification rather than traditional three-class NLI

Model Capabilities

Zero-shot text classification

Multi-domain classification

Natural language inference

Sentiment analysis

Content moderation

Use Cases

Content classification

News topic classification

Automatically classify news into topics such as politics, economics, entertainment, etc.

Performs well on the AG News dataset

Review sentiment analysis

Analyze sentiment tendencies in product reviews

Trained on datasets like AmazonPolarity and YelpReviews

Content moderation

Harmful content detection

Identify hate speech, offensive content, etc. in text

Trained on datasets like WikiToxic and HateOffensive

🚀 deberta-v3-base-zeroshot-v1

A model designed for zero - shot classification with the Hugging Face pipeline, offering better performance in zero - shot classification compared to other models on the Hugging Face hub.

🚀 Quick Start

The model is designed for zero - shot classification with the Hugging Face pipeline. It should perform substantially better at zero - shot classification than other zero - shot models on the Hugging Face hub: https://huggingface.co/MoritzLaurer.

✨ Features

The model can handle a universal task: determining whether a hypothesis is true or not_true given a text (also known as entailment vs. not_entailment). This task format is based on the Natural Language Inference task (NLI), and any classification task can be reformulated into it.
Compared to other NLI models, it predicts two classes (entailment vs. not_entailment) instead of three (entailment/neutral/contradiction).

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

from transformers import pipeline
classifier = pipeline("zero-shot-classification", model="MoritzLaurer/deberta-v3-base-zeroshot-v1")
sequence_to_classify = "Angela Merkel is a politician in Germany and leader of the CDU"
candidate_labels = ["politics", "economy", "entertainment", "environment"]
output = classifier(sequence_to_classify, candidate_labels, multi_label=False)
print(output)

📚 Documentation

Training data

The model was trained on a mixture of 27 tasks and 310 classes reformatted into a universal format:

26 classification tasks with ~400k texts:
- 'amazonpolarity', 'imdb', 'appreviews', 'yelpreviews', 'rottentomatoes',
- 'emotiondair', 'emocontext', 'empathetic',
- 'financialphrasebank', 'banking77', 'massive',
- 'wikitoxic_toxicaggregated', 'wikitoxic_obscene', 'wikitoxic_threat', 'wikitoxic_insult', 'wikitoxic_identityhate',
- 'hateoffensive', 'hatexplain', 'biasframes_offensive', 'biasframes_sex', 'biasframes_intent',
- 'agnews', 'yahootopics',
- 'trueteacher', 'spam', 'wellformedquery'.
- See details on each dataset here: https://docs.google.com/spreadsheets/d/1Z18tMh02IiWgh6o8pfoMiI_LH4IXpr78wd_nmNd5FaE/edit?usp=sharing
Five NLI datasets with ~885k texts: "mnli", "anli", "fever", "wanli", "ling"

Details on data and training

The code for preparing the data and training & evaluating the model is fully open - source here: https://github.com/MoritzLaurer/zeroshot - classifier/tree/main

🔧 Technical Details

The model's task format is based on the Natural Language Inference task (NLI). It can transform any classification task into the task of determining whether a hypothesis is true or not_true given a text.

📄 License

The base model (DeBERTa - v3) is published under the MIT license. The datasets the model was fine - tuned on are published under a diverse set of licenses. The following spreadsheet provides an overview of the non - NLI datasets used for fine - tuning, containing information on licenses, the underlying papers etc.: https://docs.google.com/spreadsheets/d/1Z18tMh02IiWgh6o8pfoMiI_LH4IXpr78wd_nmNd5FaE/edit?usp=sharing

In addition, the model was also trained on the following NLI datasets: MNLI, ANLI, WANLI, LING - NLI, FEVER - NLI.

Limitations and bias

⚠️ Important Note

The model can only do text classification tasks. Please consult the original DeBERTa paper and the papers for the different datasets for potential biases.

Citation

If you use this model, please cite:

@article{laurer_less_2023,
	title = {Less {Annotating}, {More} {Classifying}: {Addressing} the {Data} {Scarcity} {Issue} of {Supervised} {Machine} {Learning} with {Deep} {Transfer} {Learning} and {BERT}-{NLI}},
	issn = {1047-1987, 1476-4989},
	shorttitle = {Less {Annotating}, {More} {Classifying}},
	url = {https://www.cambridge.org/core/product/identifier/S1047198723000207/type/journal_article},
	doi = {10.1017/pan.2023.20},
	language = {en},
	urldate = {2023-06-20},
	journal = {Political Analysis},
	author = {Laurer, Moritz and Van Atteveldt, Wouter and Casas, Andreu and Welbers, Kasper},
	month = jun,
	year = {2023},
	pages = {1--33},
}

Ideas for cooperation or questions?

If you have questions or ideas for cooperation, contact me at m{dot}laurer{at}vu{dot}nl or [LinkedIn](https://www.linkedin.com/in/moritz - laurer/)

Debugging and issues

💡 Usage Tip

Note that DeBERTa - v3 was released on 06.12.21 and older versions of HF Transformers seem to have issues running the model (e.g., resulting in an issue with the tokenizer). Using Transformers>=4.13 might solve some issues.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご