distilbert-base-uncased-finetuned-sst-2-english open-source text classification model

Distilbert Base Uncased Finetuned Sst 2 English

Developed by distilbert

Text classification model fine-tuned on the SST-2 sentiment analysis dataset based on DistilBERT-base-uncased, with 91.3% accuracy

Text Classification EnglishOpen Source License:Apache-2.0 #Sentiment Analysis #Text Classification #English NLP

Downloads 5.2M

Release Time : 3/2/2022

Model Overview

Lightweight BERT variant optimized for English text sentiment analysis tasks

Model Features

Efficient & Lightweight

40% smaller than original BERT while retaining 97% performance

Fast Inference

Distilled architecture provides 60% speed improvement

Domain Adaptation

Specialized optimization for movie review sentiment analysis

Model Capabilities

Text Classification

Sentiment Analysis

Sentence-level Feature Extraction

Use Cases

Content Analysis

Movie Review Sentiment Analysis

Determine sentiment polarity (positive/negative) of movie reviews

91.3% accuracy on development set

Social Media Monitoring

Analyze emotional tendencies in user posts

🚀 DistilBERT base uncased finetuned SST-2

This model is a fine - tuned version of DistilBERT on the SST - 2 dataset. It's designed for text classification tasks and can achieve high accuracy in sentiment analysis.

🚀 Quick Start

Basic Usage

import torch
from transformers import DistilBertTokenizer, DistilBertForSequenceClassification

tokenizer = DistilBertTokenizer.from_pretrained("distilbert-base-uncased-finetuned-sst-2-english")
model = DistilBertForSequenceClassification.from_pretrained("distilbert-base-uncased-finetuned-sst-2-english")

inputs = tokenizer("Hello, my dog is cute", return_tensors="pt")
with torch.no_grad():
    logits = model(**inputs).logits

predicted_class_id = logits.argmax().item()
model.config.id2label[predicted_class_id]

✨ Features

High Accuracy: Reaches an accuracy of 91.3 on the dev set.
Fine - tuned: Based on DistilBERT - base - uncased and fine - tuned on SST - 2.
Text Classification: Suitable for single - label text classification tasks.

📚 Documentation

Model Details

Property	Details
Model Type	Text Classification
Developed by	Hugging Face
Language(s)	English
License	Apache - 2.0
Parent Model	DistilBERT - base - uncased
Resources for more information	Model Documentation, DistilBERT paper

This model is a fine - tune checkpoint of DistilBERT - base - uncased, fine - tuned on SST - 2. It reaches an accuracy of 91.3 on the dev set (for comparison, Bert bert - base - uncased version reaches an accuracy of 92.7).

Uses

Direct Use

This model can be used for topic classification. You can use the raw model for either masked language modeling or next sentence prediction, but it's mostly intended to be fine - tuned on a downstream task. See the model hub to look for fine - tuned versions on a task that interests you.

Misuse and Out - of - scope Use

The model should not be used to intentionally create hostile or alienating environments for people. In addition, the model was not trained to be factual or true representations of people or events, and therefore using the model to generate such content is out - of - scope for the abilities of this model.

Risks, Limitations and Biases

Based on a few experimentations, we observed that this model could produce biased predictions that target underrepresented populations.

For instance, for sentences like This film was filmed in COUNTRY, this binary classification model will give radically different probabilities for the positive label depending on the country (0.89 if the country is France, but 0.08 if the country is Afghanistan) when nothing in the input indicates such a strong semantic shift. In this colab, Aurélien Géron made an interesting map plotting these probabilities for each country.

Map of positive probabilities per country.

We strongly advise users to thoroughly probe these aspects on their use - cases in order to evaluate the risks of this model. We recommend looking at the following bias evaluation datasets as a place to start: WinoBias, WinoGender, Stereoset.

Training

Training Data

The authors use the following Stanford Sentiment Treebank(sst2) corpora for the model.

Training Procedure

Fine - tuning hyper - parameters

learning_rate = 1e - 5
batch_size = 32
warmup = 600
max_seq_length = 128
num_train_epochs = 3.0

📄 License

This model is licensed under the Apache - 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご