bart-large-mnli-yahoo-answers Open Source Model - Free for Zero-shot Classification of Yahoo Answers Topics

Bart Large Mnli Yahoo Answers

Developed by joeddav

A zero-shot classification model fine-tuned based on BART-large-MNLI, optimized for Yahoo Answers topic classification

Text Classification English#Zero-shot classification #Topic recognition #Yahoo Answers optimization

Downloads 190.85k

Release Time : 3/2/2022

Model Overview

This model is optimized for Yahoo Answers topic classification tasks, capable of predicting whether a text sequence is suitable for a specific topic label, supporting zero-shot classification scenarios.

Model Features

Zero-shot classification capability

Can classify new labels without training data for specific labels

Topic classification optimization

Specifically fine-tuned and optimized for Yahoo Answers topic classification tasks

Template adaptation

Uses specific hypothesis templates ('This text is about {}.' ) to improve classification accuracy

Model Capabilities

Text classification

Zero-shot learning

Topic recognition

Use Cases

Content classification

Q&A topic classification

Classify topics for Yahoo Answers content

F1 score of 0.72 on seen labels and 0.68 on unseen labels

Social media content analysis

Identify the topic category of social media posts

🚀 bart-lage-mnli-yahoo-answers

This model fine - tunes facebook/bart-large-mnli on Yahoo Answers topic classification. It can predict whether a topic label can be assigned to a given sequence, regardless of whether the label has been seen before.

🚀 Quick Start

You can play with an interactive demo of this zero - shot technique with this model, as well as the non - finetuned facebook/bart-large-mnli, here.

✨ Features

This model is fine - tuned on topic classification and performs best at zero - shot topic classification.
It can handle both seen and unseen labels for topic classification.

📦 Installation

No specific installation steps are provided in the original document, so this section is skipped.

💻 Usage Examples

Basic Usage

The model can be used with the zero - shot - classification pipeline like so:

from transformers import pipeline
nlp = pipeline("zero-shot-classification", model="joeddav/bart-large-mnli-yahoo-answers")

sequence_to_classify = "Who are you voting for in 2020?"
candidate_labels = ["Europe", "public health", "politics", "elections"]
hypothesis_template = "This text is about {}."
nlp(sequence_to_classify, candidate_labels, multi_class=True, hypothesis_template=hypothesis_template)

Advanced Usage

# pose sequence as a NLI premise and label as a hypothesis
from transformers import BartForSequenceClassification, BartTokenizer
nli_model = BartForSequenceClassification.from_pretrained('joeddav/bart-large-mnli-yahoo-answers')
tokenizer = BartTokenizer.from_pretrained('joeddav/bart-large-mnli-yahoo-answers')

premise = sequence
hypothesis = f'This text is about {label}.'

# run through model pre-trained on MNLI
x = tokenizer.encode(premise, hypothesis, return_tensors='pt',
                        max_length=tokenizer.max_len,
                        truncation_strategy='only_first')
logits = nli_model(x.to(device))[0]

# we throw away "neutral" (dim 1) and take the probability of
# "entailment" (2) as the probability of the label being true 
entail_contradiction_logits = logits[:,[0,2]]
probs = entail_contradiction_logits.softmax(dim=1)
prob_label_is_true = probs[:,1]

📚 Documentation

Intended Usage

This model was fine - tuned on topic classification and will perform best at zero - shot topic classification. Use hypothesis_template="This text is about {}." as this is the template used during fine - tuning.

For settings other than topic classification, you can use any model pre - trained on MNLI such as facebook/bart-large-mnli or roberta-large-mnli with the same code as written above.

Training

The model is a pre - trained MNLI classifier further fine - tuned on Yahoo Answers topic classification in the manner originally described in Yin et al. 2019 and this blog post. That is, each sequence is fed to the pre - trained NLI model in place of the premise and each candidate label as the hypothesis, formatted like so: This text is about {class name}. For each example in the training set, a true and a randomly - selected false label hypothesis are fed to the model which must predict which labels are valid and which are false.

Since this method studies the ability to classify unseen labels after being trained on a different set of labels, the model is only trained on 5 out of the 10 labels in Yahoo Answers. These are "Society & Culture", "Health", "Computers & Internet", "Business & Finance", and "Family & Relationships".

Evaluation Results

This model was evaluated with the label - weighted F1 of the seen and unseen labels. That is, for each example the model must predict from one of the 10 corpus labels. The F1 is reported for the labels seen during training as well as the labels unseen during training. We found an F1 score of .68 and .72 for the unseen and seen labels, respectively. In order to adjust for the in - vs - out of distribution labels, we subtract a fixed amount of 30% from the normalized probabilities of the seen labels, as described in Yin et al. 2019 and our blog post.

📄 License

This model is licensed under the Apache 2.0 license.

Property	Details
Model Type	Fine - tuned Bart - large - mnli for zero - shot topic classification
Training Data	Yahoo Answers (trained on 5 out of 10 labels: "Society & Culture", "Health", "Computers & Internet", "Business & Finance", and "Family & Relationships")
Base Model	facebook/bart - large - mnli
Pipeline Tag	zero - shot - classification

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご