roberta-base_stress_classification Open-source Model - Accurately Identify Stress-related Content in Employees' Reviews

Roberta Base Stress Classification

Developed by dstefa

A stress classification model fine-tuned based on roberta-base, used to identify stress-related content in employee reviews.

Text Classification

Transformers

Open Source License:MIT #Employee Sentiment Analysis #High-Precision Classification #Workplace Stress Detection

Downloads 20

Release Time : 1/23/2024

Model Overview

This model was fine-tuned on 100,000 Glassdoor employee reviews, specifically designed to classify whether text contains work-related stress content.

Model Features

High Accuracy

Achieves 96.47% accuracy and F1 score on the evaluation set.

Domain-Specific Optimization

Fine-tuned specifically for employee review data, suitable for workplace stress identification.

Efficient Training

Requires only 5 training epochs to achieve excellent performance.

Model Capabilities

Text Classification

Stress Content Identification

Employee Review Analysis

Use Cases

Human Resource Management

Employee Satisfaction Analysis

Analyze stress content in employee reviews to assess work environment.

Accurately identifies over 96% of stress-related reviews.

Workplace Stress Monitoring

Automatically monitor stress signals in employee feedback.

Helps HR departments identify potential issues in a timely manner.

🚀 roberta-base_stress_classification

This model is a fine - tuned version of [roberta - base](https://huggingface.co/roberta - base) on the glassdoor dataset based on 100000 employees' reviews, aiming to classify stress levels in text.

🚀 Quick Start

This model is a fine - tuned version of [roberta - base](https://huggingface.co/roberta - base) on the glassdoor dataset based on 100000 employees' reviews. It achieves the following results on the evaluation set:

Loss: 0.1800
Accuracy: 0.9647
F1: 0.9647
Precision: 0.9647
Recall: 0.9647

📦 Installation

No specific installation steps are provided in the original document, so this section is skipped.

✨ Features

Text Classification: Capable of classifying text as either "Stressed" or "Not Stressed".
High Performance: Achieves high accuracy, F1, precision, and recall scores on the evaluation set.

💻 Usage Examples

Basic Usage

from transformers import AutoTokenizer, AutoModelForSequenceClassification
from transformers import pipeline

tokenizer = AutoTokenizer.from_pretrained("dstefa/roberta - base_topic_classification_nyt_news")
model = AutoModelForSequenceClassification.from_pretrained("dstefa/roberta - base_topic_classification_nyt_news")
pipe = pipeline("text - classification", model=model, tokenizer=tokenizer, device=0)

text = "They also caused so much stress because some leaders valued optics over output."
pipe(text)

[{'label': 'Stressed', 'score': 0.9959163069725037}]

📚 Documentation

Training data

Training data was classified as follow:

Class	Description
0	Not Stressed
1	Stressed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e - 05
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon = 1e - 08
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 500
num_epochs: 5

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1	Precision	Recall
0.704	1.0	8000	0.6933	0.5	0.3333	0.25	0.5
0.6926	2.0	16000	0.6980	0.5	0.3333	0.25	0.5
0.0099	3.0	24000	0.1800	0.9647	0.9647	0.9647	0.9647
0.2727	4.0	32000	0.2243	0.9526	0.9526	0.9527	0.9526
0.0618	5.0	40000	0.2128	0.9536	0.9536	0.9546	0.9536

Model performance

	precision	recall	f1	support
Not Stressed	0.96	0.97	0.97	10000
Stressed	0.97	0.96	0.97	10000

accuracy			0.97	20000
macro avg	0.97	0.97	0.97	20000
weighted avg	0.97	0.97	0.97	20000

Framework versions

Transformers 4.32.1
Pytorch 2.1.0+cu121
Datasets 2.12.0
Tokenizers 0.13.2

📄 License

This project is licensed under the MIT license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご