byt5-base-tweet-hate-detection Open-source Model - Accurately Detect Hate Speech in Tweets

Byt5 Base Tweet Hate Detection

Developed by Narrativa

This model is a sequence classification model fine-tuned based on ByT5-base, specifically designed for detecting hate speech in tweets.

Text Classification English#Tweet hate detection #No tokenization processing #Noisy text optimization

Downloads 42

Release Time : 3/2/2022

Model Overview

The model is fine-tuned on a tweet hate speech detection dataset to identify racist or sexist content in tweets.

Model Features

Tokenizer-free design

ByT5 is a tokenizer-free version that directly processes UTF-8 byte sequences, making it particularly suitable for handling noisy text data.

Optimized for tweets

Specifically fine-tuned for tweet content, effectively identifying hate speech on social media.

Strong noise-handling capability

Outperforms similar models on noisy text tasks like TweetsQA.

Model Capabilities

Text classification

Hate speech detection

Social media content analysis

Use Cases

Social media content moderation

Automatic hate speech detection

Automatically identifies racist or sexist content in tweets

Achieved an F1 score of 79.8 on the test set

Online community management

Filtering inappropriate content

Helps community administrators quickly identify and handle hate speech

🚀 ByT5-base fine-tuned for Hate Speech Detection (on Tweets)

ByT5-base fine-tuned on the tweets hate speech detection dataset for sequence classification.

This is a fine-tuned version of ByT5 on the tweets hate speech detection dataset for the Sequence Classification downstream task.

✨ Features

Details of ByT5 - Base 🧠

ByT5 is a tokenizer-free version of Google's T5 and generally follows the architecture of MT5. It was only pre-trained on mC4 without any supervised training, using an average span-mask of 20 UTF-8 characters. Thus, this model needs to be fine-tuned before it can be used for a downstream task. ByT5 performs particularly well on noisy text data. For example, google/byt5-base significantly outperforms mt5-base on TweetQA.

Paper: ByT5: Towards a token-free future with pre-trained byte-to-byte models
Authors: Linting Xue, Aditya Barua, Noah Constant, Rami Al-Rfou, Sharan Narang, Mihir Kale, Adam Roberts, Colin Raffel

Details of the downstream task (Sequence Classification as Text generation) - Dataset 📚

The tweets_hate_speech_detection dataset aims to detect hate speech in tweets. For simplicity, a tweet is considered to contain hate speech if it has a racist or sexist sentiment. The task is to classify racist or sexist tweets from other tweets.

Formally, given a training sample of tweets and labels (where label ‘1’ denotes the tweet is racist/sexist and label ‘0’ denotes the tweet is not), the objective is to predict the labels on the given test dataset.

Data Instances: The dataset contains a label indicating whether a tweet is hate speech or not.

{'label': 0,  # not a hate speech
 'tweet': ' @user when a father is dysfunctional and is so selfish he drags his kids into his dysfunction.   #run'}

Data Fields:
- label: 1 - it is a hate speech, 0 - not a hate speech
- tweet: content of the tweet as a string
Data Splits: The data contains training data with 31962 entries.

Test set metrics 🧾

A representative test set was created with 5% of the entries. Due to the imbalanced dataset, the model achieved a F1 score of 79.8.

📦 Installation

git clone https://github.com/huggingface/transformers.git
pip install -q ./transformers

💻 Usage Examples

Basic Usage

from transformers import AutoTokenizer, T5ForConditionalGeneration

ckpt = 'Narrativa/byt5-base-tweet-hate-detection'

tokenizer = AutoTokenizer.from_pretrained(ckpt)
model = T5ForConditionalGeneration.from_pretrained(ckpt).to("cuda")

def classify_tweet(tweet):

    inputs = tokenizer([tweet], padding='max_length', truncation=True, max_length=512, return_tensors='pt')
    input_ids = inputs.input_ids.to('cuda')
    attention_mask = inputs.attention_mask.to('cuda')
    output = model.generate(input_ids, attention_mask=attention_mask)
    return tokenizer.decode(output[0], skip_special_tokens=True)
    
    
classify_tweet('here goes your tweet...')

📄 License

This model is created by Narrativa. Narrativa focuses on Natural Language Generation (NLG). Gabriele, its machine learning-based platform, builds and deploys natural language solutions. #NLG #AI

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご