Koelectra-base-v3 Open-Source Korean Sentiment Analysis Model: Quickly Determine the Positive or Negative Tendency of Texts

Koelectra Base V3 Generalized Sentiment Analysis

Developed by Copycats

Korean sentiment analysis model based on KoELECTRA-v3, used to determine the positive or negative sentiment tendency of text.

Text Classification

Transformers

KoreanOpen Source License:Apache-2.0 #Korean Sentiment Analysis #E-commerce Review Classification #High-Precision Classification

Downloads 3,459

Release Time : 3/2/2022

Model Overview

This model is a Korean sentiment classifier, trained on the KoELECTRA-v3 architecture, specifically designed to analyze the sentiment tendency (positive/negative) of Korean text.

Model Features

Korean Sentiment Analysis

Sentiment analysis capability optimized specifically for Korean text.

High Accuracy

Demonstrates high classification confidence in examples (most scores >0.99).

Mixed Sentiment Handling

Capable of processing text containing mixed sentiments (e.g., 'The taste is good but the price is a bit expensive').

Model Capabilities

Korean text sentiment classification

Positive/Negative sentiment judgment

Short text sentiment analysis

Use Cases

E-commerce Review Analysis

Product Review Sentiment Classification

Analyze whether user reviews of products are positive or negative.

Can accurately identify product strengths and weaknesses (e.g., 'The craftsmanship is terrible' in the example was correctly classified as negative).

Social Media Monitoring

Public Opinion Analysis

Monitor sentiment tendencies towards brands or topics on social media.

🚀 Text Classification with Transformers

This project utilizes the transformers library for text classification, specifically for sentiment analysis of Korean reviews. It provides a simple way to classify reviews as positive or negative.

🚀 Quick Start

To get started with this text classification task, you need to follow these steps. First, import the necessary libraries and load the pre - trained model and tokenizer. Then, you can define a list of reviews and use the classification pipeline to predict the sentiment of each review.

💻 Usage Examples

Basic Usage

# import library
import torch
from transformers import AutoTokenizer, AutoModelForSequenceClassification, TextClassificationPipeline

# load model
tokenizer = AutoTokenizer.from_pretrained("Copycats/koelectra-base-v3-generalized-sentiment-analysis")
model = AutoModelForSequenceClassification.from_pretrained("Copycats/koelectra-base-v3-generalized-sentiment-analysis")
sentiment_classifier = TextClassificationPipeline(tokenizer=tokenizer, model=model)

# target reviews
review_list = [
    '이쁘고 좋아요~~~씻기도 편하고 아이고 이쁘다고 자기방에 갖다놓고 잘써요~^^',
    '아직 입어보진 않았지만 굉장히 가벼워요~~ 다른 리뷰처럼 어깡이 좀 되네요ㅋ 만족합니다. 엄청 빠른발송 감사드려요 :)',
    '재구매 한건데 너무너무 가성비인거 같아요!! 다음에 또 생각나면 3개째 또 살듯..ㅎㅎ',
    '가습량이 너무 적어요. 방이 작지 않다면 무조건 큰걸로구매하세요. 물량도 조금밖에 안들어가서 쓰기도 불편함',
    '한번입었는데 옆에 봉제선 다 풀리고 실밥도 계속 나옵니다. 마감 처리 너무 엉망 아닌가요?',
    '따뜻하고 좋긴한데 배송이 느려요',
    '맛은 있는데 가격이 있는 편이에요'
]

# predict
for idx, review in enumerate(review_list):
    pred = sentiment_classifier(review)
    print(f'{review}\n>> {pred[0]}')

The output of the above code will be as follows:

이쁘고 좋아요~~~씻기도 편하고 아이고 이쁘다고 자기방에 갖다놓고 잘써요~^^
>> {'label': '1', 'score': 0.9945501685142517}
아직 입어보진 않았지만 굉장히 가벼워요~~ 다른 리뷰처럼 어깡이 좀 되네요ㅋ 만족합니다. 엄청 빠른발송 감사드려요 :)
>> {'label': '1', 'score': 0.995430588722229}
재구매 한건데 너무너무 가성비인거 같아요!! 다음에 또 생각나면 3개째 또 살듯..ㅎㅎ
>> {'label': '1', 'score': 0.9959582686424255}
가습량이 너무 적어요. 방이 작지 않다면 무조건 큰걸로구매하세요. 물량도 조금밖에 안들어가서 쓰기도 불편함
>> {'label': '0', 'score': 0.9984619617462158}
한번입었는데 옆에 봉제선 다 풀리고 실밥도 계속 나옵니다. 마감 처리 너무 엉망 아닌가요?
>> {'label': '0', 'score': 0.9991756677627563}
따뜻하고 좋긴한데 배송이 느려요
>> {'label': '1', 'score': 0.6473883390426636}
맛은 있는데 가격이 있는 편이에요
>> {'label': '1', 'score': 0.5128092169761658}

Label Explanation

label 0 : negative review
label 1 : positive review

📄 License

This project is licensed under the Apache 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご