KoELECTRA_fine_tunning_emotion Open-source Korean Sentiment Classification Model

Koelectra Fine Tunning Emotion

Developed by Jinuuuu

A Korean sentiment classification model based on KoELECTRA fine - tuning, capable of identifying six emotions: anger, happiness, anxiety, embarrassment, sadness, and heartache.

Text Classification

Transformers

KoreanOpen Source License:MIT #Korean sentiment analysis #Multi - sentiment classification #Social media analysis

Downloads 441

Release Time : 6/10/2025

Model Overview

This model is specifically designed for Korean sentiment classification by fine - tuning KoELECTRA. It can accurately identify six main emotions and support Korean sentiment analysis.

Model Features

Accurate classification

It can accurately identify six main Korean emotions, including anger, happiness, anxiety, embarrassment, sadness, and heartache.

Multiple usage methods

It supports invocation through both the Transformers library and Pipeline, facilitating use in different scenarios.

Wide application

It can be applied in multiple fields such as social media sentiment analysis, customer review analysis, and chatbot sentiment recognition.

Model Capabilities

Korean sentiment classification

Emotion probability calculation

Multi - emotion recognition

Use Cases

Social media analysis

Post sentiment analysis

Analyze the sentiment tendency in social media posts.

Accuracy rate over 85%

Customer service

Review sentiment analysis

Classify the sentiment of product or service reviews.

F1 score 0.83+

Dialogue system

Chatbot sentiment recognition

Recognize the user's emotion in the dialogue system.

🚀 KoELECTRA Fine-tuned for Korean Emotion Classification

This model is fine-tuned from KoELECTRA for Korean emotion classification, capable of classifying six major emotions: anger, happiness, anxiety, embarrassment, sadness, and heartache.

🚀 Quick Start

This model is fine-tuned from KoELECTRA for Korean emotion classification. It can classify six major emotions: anger, happiness, anxiety, embarrassment, sadness, and heartache.

Base Model: KoELECTRA (Korean ELECTRA)
Task: Multi-class Emotion Classification
Language: Korean
License: MIT

✨ Features

Capable of classifying six major emotions: anger, happiness, anxiety, embarrassment, sadness, and heartache.
Can be used in various applications such as social media emotion analysis, customer review analysis, chatbot emotion recognition, content recommendation, music recommendation, and literary analysis.

📦 Installation

To use this model, you need to install the transformers library. You can install it using the following command:

pip install transformers

💻 Usage Examples

Basic Usage

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

# Load the model and tokenizer
model_name = "Jinuuuu/KoELECTRA_fine_tunning_emotion"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)

# Emotion analysis function
def analyze_emotion(text):
    # Tokenize the text
    inputs = tokenizer(
        text,
        return_tensors="pt",
        truncation=True,
        max_length=512,
        padding=True
    )
    
    # Make predictions
    with torch.no_grad():
        outputs = model(**inputs)
        
    # Calculate probabilities
    probs = torch.softmax(outputs.logits, dim=1)
    
    # Emotion labels
    emotion_labels = ['angry', 'anxious', 'embarrassed', 'happy', 'heartache', 'sad']
    
    # Return results
    emotion_probs = {}
    for i, label in enumerate(emotion_labels):
        emotion_probs[label] = float(probs[0][i])
    
    return emotion_probs

# Example usage
text = "오늘은 정말 행복한 하루였다."
result = analyze_emotion(text)

print("Emotion analysis result:")
for emotion, prob in sorted(result.items(), key=lambda x: x[1], reverse=True):
    print(f"{emotion}: {prob:.3f}")

Advanced Usage

from transformers import pipeline

# Create a pipeline
classifier = pipeline(
    "text-classification",
    model="Jinuuuu/KoELECTRA_fine_tunning_emotion",
    tokenizer="Jinuuuu/KoELECTRA_fine_tunning_emotion"
)

# Emotion analysis
texts = [
    "오늘은 정말 행복한 하루였다.",
    "너무 화가 나서 참을 수 없다.",
    "내일 시험이 걱정된다."
]

results = classifier(texts)
for text, result in zip(texts, results):
    print(f"Text: {text}")
    print(f"Emotion: {result['label']} (Probability: {result['score']:.3f})")
    print()

📚 Documentation

Emotion Labels

The model classifies the following six emotions:

Label	Korean	Description
`angry`	분노	Anger, irritation, annoyance
`happy`	행복	Happiness, joy, satisfaction
`anxious`	불안	Anxiety, worry, fear
`embarrassed`	당황	Embarrassment, confusion, discomfiture
`sad`	슬픔	Sadness, depression, disappointment
`heartache`	상처	Heartache, betrayal, disappointment

Model Architecture

Base Model: KoELECTRA-base
Model Type: Sequence Classification
Hidden Size: 768
Num Attention Heads: 12
Num Hidden Layers: 12
Max Sequence Length: 512
Vocab Size: 35000
Num Labels: 6

Training Details

Training Data

Dataset: Custom Korean Emotion Dataset
Training Samples: ~50,000 sentences
Validation Samples: ~10,000 sentences
Data Source: Korean social media posts, reviews, and literature

Training Hyperparameters

Learning Rate: 2e-5
Batch Size: 16
Epochs: 3-5
Warmup Steps: 500
Weight Decay: 0.01
Max Sequence Length: 512

Training Environment

Framework: PyTorch + Transformers
Hardware: GPU (CUDA enabled)
Optimizer: AdamW

Performance

Overall Performance

Metric	Score
Accuracy	0.85+
F1-Score (Macro)	0.83+
F1-Score (Weighted)	0.85+

Per-Class Performance

Emotion	Precision	Recall	F1-Score
angry	0.87	0.84	0.85
happy	0.89	0.91	0.90
anxious	0.82	0.79	0.80
embarrassed	0.78	0.76	0.77
sad	0.85	0.87	0.86
heartache	0.81	0.83	0.82

Applications

This model can be used for the following purposes:

Social Media Emotion Analysis: Understanding the emotions in posts and comments.
Customer Review Analysis: Classifying the emotions in product/service reviews.
Chatbot Emotion Recognition: Understanding the user's emotions in a conversation system.
Content Recommendation: Recommending content based on emotions.
Music Recommendation: Recommending music based on text emotions.
Literary Analysis: Analyzing the emotions in novels, poems, etc.

Limitations

The model is optimized for Korean text.
It can process a maximum of 512 tokens.
The accuracy of emotion classification may vary depending on the context.
The performance on slang, neologisms, and dialects may be limited.

Bias and Fairness

This model may reflect the biases in the training data. It may show biased results for certain topics or expressions. Therefore, sufficient validation and monitoring are required when applying it to real services.

🔧 Technical Details

The model is fine-tuned from KoELECTRA, a Korean ELECTRA model. It uses a sequence classification architecture to classify six major emotions in Korean text. The training data consists of Korean social media posts, reviews, and literature. The model is trained using PyTorch and the Transformers library with the AdamW optimizer.

📄 License

This model is released under the MIT license.

Citation

@misc{koelectra_emotion_2024,
  title={KoELECTRA Fine-tuned for Korean Emotion Classification},
  author={Jinuuuu},
  year={2024},
  publisher={Hugging Face},
  howpublished={\url{https://huggingface.co/Jinuuuu/KoELECTRA_fine_tunning_emotion}}
}

Model Card Authors

Developer: Jinuuuu
Model Type: Text Classification
Language: Korean
License: MIT

Contact

If you have any questions or suggestions for improvement regarding the model, please contact us through GitHub issues or the Hugging Face model page.

💡 Usage Tip

This model is developed for research and educational purposes. When using it for commercial purposes, please conduct sufficient verification and testing.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご