BERT-Election2020-Twitter-Stance-Biden-KE-MLM Open Source Model - Accurately Detect Stance on Biden in 2020 US Election Tweets

Bert Election2020 Twitter Stance Biden KE MLM

Developed by kornosk

This is a pre-trained language model based on the BERT-base architecture, specifically optimized for detecting stances on Joe Biden in tweets during the 2020 US election.

Text Classification EnglishOpen Source License:Gpl-3.0 #Political Stance Detection #Twitter Text Analysis #Knowledge-Enhanced Pre-training

Downloads 69

Release Time : 3/2/2022

Model Overview

The model is pre-trained using the Knowledge-Enhanced Masked Language Model (KE-MLM) method and fine-tuned on annotated Twitter datasets to detect support, opposition, or neutral stances towards Joe Biden.

Model Features

Knowledge-Enhanced Pre-training

Uses Knowledge-Enhanced Masked Language Model (KE-MLM) for pre-training, improving the accuracy of stance detection.

Domain-Specific Optimization

Specifically optimized for political tweets during the 2020 US election, excelling in political stance detection tasks.

Three-Class Classification

Capable of identifying support, opposition, and neutral stances.

Model Capabilities

Text Classification

Stance Detection

Political Text Analysis

Social Media Content Analysis

Use Cases

Political Analysis

Candidate Support Analysis

Analyzes the distribution of support, opposition, and neutral attitudes towards Joe Biden on social media.

Quantifies the candidate's popularity on social media.

Public Opinion Monitoring

Monitors real-time changes in public opinion trends about political figures on social media.

Helps political teams adjust campaign strategies promptly.

Academic Research

Political Communication Research

Used to study the patterns and effects of political information dissemination on social media.

Provides data support for political communication studies.

🚀 Pre-trained BERT on Twitter US Election 2020 for Stance Detection towards Joe Biden (KE-MLM)

This project provides pre-trained weights for the KE-MLM model, which is used for stance detection towards Joe Biden in the context of the 2020 US Twitter election.

✨ Features

Pre-trained on over 5 million English tweets about the 2020 US Presidential Election.
Fine-tuned using stance-labeled data for stance detection towards Joe Biden.
Initialized with BERT-base and trained with a normal MLM objective, with the classification layer fine-tuned for stance detection.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
import numpy as np

# choose GPU if available
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

# select mode path here
pretrained_LM_path = "kornosk/bert-election2020-twitter-stance-biden-KE-MLM"

# load model
tokenizer = AutoTokenizer.from_pretrained(pretrained_LM_path)
model = AutoModelForSequenceClassification.from_pretrained(pretrained_LM_path)

id2label = {
    0: "AGAINST",
    1: "FAVOR",
    2: "NONE"
}

##### Prediction Neutral #####
sentence = "Hello World."
inputs = tokenizer(sentence.lower(), return_tensors="pt")
outputs = model(**inputs)
predicted_probability = torch.softmax(outputs[0], dim=1)[0].tolist()

print("Sentence:", sentence)
print("Prediction:", id2label[np.argmax(predicted_probability)])
print("Against:", predicted_probability[0])
print("Favor:", predicted_probability[1])
print("Neutral:", predicted_probability[2])

##### Prediction Favor #####
sentence = "Go Go Biden!!!"
inputs = tokenizer(sentence.lower(), return_tensors="pt")
outputs = model(**inputs)
predicted_probability = torch.softmax(outputs[0], dim=1)[0].tolist()

print("Sentence:", sentence)
print("Prediction:", id2label[np.argmax(predicted_probability)])
print("Against:", predicted_probability[0])
print("Favor:", predicted_probability[1])
print("Neutral:", predicted_probability[2])

##### Prediction Against #####
sentence = "Biden is the worst."
inputs = tokenizer(sentence.lower(), return_tensors="pt")
outputs = model(**inputs)
predicted_probability = torch.softmax(outputs[0], dim=1)[0].tolist()

print("Sentence:", sentence)
print("Prediction:", id2label[np.argmax(predicted_probability)])
print("Against:", predicted_probability[0])
print("Favor:", predicted_probability[1])
print("Neutral:", predicted_probability[2])

# please consider citing our paper if you feel this is useful :)

📚 Documentation

Training Data

This model is pre-trained on over 5 million English tweets about the 2020 US Presidential Election. Then fine-tuned using our stance-labeled data for stance detection towards Joe Biden.

Training Objective

This model is initialized with BERT-base and trained with normal MLM objective with classification layer fine-tuned for stance detection towards Joe Biden.

Usage

This pre-trained language model is fine-tuned to the stance detection task specifically for Joe Biden. Please see the official repository for more detail.

📄 License

This project is licensed under the GPL-3.0 license.

📖 Reference

Knowledge Enhance Masked Language Model for Stance Detection, NAACL 2021.

📚 Citation

@inproceedings{kawintiranon2021knowledge,
    title={Knowledge Enhanced Masked Language Model for Stance Detection},
    author={Kawintiranon, Kornraphop and Singh, Lisa},
    booktitle={Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
    year={2021},
    publisher={Association for Computational Linguistics},
    url={https://www.aclweb.org/anthology/2021.naacl-main.376}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご