Fast Emotion-Xオープンソース感情検出モデル - テキストを6種類の感情カテゴリに正確に分類

ホーム

Deberta V3 Small Base Emotions Classifier

AnkitAIによって開発

Fast Emotion-Xは、MicrosoftのDeBERTa V3 Smallモデルをファインチューニングした感情検出モデルで、テキストを6つの感情カテゴリーに正確に分類できます。

テキスト分類

Transformers

英語オープンソースライセンス:MIT #高精度感情分類 #DeBERTaファインチューニング #多様なシナリオの感情分析

ダウンロード数 518

リリース時間 : 6/30/2024

モデル概要

このモデルはDeBERTaの強力な能力を活用し、感情データセットでファインチューニングされており、怒り、嫌悪、恐怖、幸福、悲しみ、驚きの6つの感情を正確に識別できます。

モデル特徴

高精度

評価精度は94.6%で、6つの感情を正確に識別できます。

効率的なファインチューニング

DeBERTa V3 Smallモデルをベースにファインチューニングされており、その強力なテキスト処理能力を十分に活用しています。

多機能アプリケーション

単一テキスト分類、バッチ処理、可視化分析など、さまざまな使用方法をサポートしています。

モデル能力

テキスト感情分類

バッチテキスト処理

感情傾向分析

使用事例

感情分析

ソーシャルメディア感情モニタリング

ソーシャルメディア上のユーザーコメントの感情を分析し、特定のトピックに対する一般の感情的な傾向を理解します。

怒りや幸福などの感情を正確に識別し、世論分析に役立ちます。

顧客フィードバック分析

顧客フィードバックテキストの感情を分析し、不満や満足している顧客を識別します。

顧客の感情を迅速に分類し、顧客サービスの質を向上させます。

🚀 Fast Emotion-X: ファインチューニングされたDeBERTa V3 Smallベースの感情検出

Fast Emotion-Xは、最先端の感情検出モデルで、MicrosoftのDeBERTa V3 Smallモデルをファインチューニングしたものです。このモデルは、テキストを6つの感情カテゴリのいずれかに正確に分類するように設計されています。DeBERTaの強力な機能を活用し、包括的な感情データセットでファインチューニングされているため、高精度と信頼性が保証されています。

✨ 主な機能

テキストを6つの感情カテゴリ（怒り、嫌悪、恐怖、喜び、悲しみ、驚き）に分類することができます。
提供されるPythonパッケージまたはHugging Faceのtransformersライブラリを使用して直接利用できます。
単一のテキストや複数のテキストを一度に分類することができます。
感情の分布を可視化することができます。
コマンドラインインターフェース（CLI）を使用して操作できます。
pandas DataFrameと統合してテキスト列を分類することができます。
感情のトレンドを分析して可視化することができます。
独自のデータセットで事前学習済みモデルをファインチューニングすることができます。

📦 インストール

pipを使用してパッケージをインストールします：

pip install emotionclassifier

💻 使用例

基本的な使用法

emotionclassifierを使用して単一のテキストを分類する例を次に示します：

from emotionclassifier import EmotionClassifier

# デフォルトモデルで分類器を初期化
classifier = EmotionClassifier()

# 単一のテキストを分類
text = "I am very happy today!"
result = classifier.predict(text)
print("Emotion:", result['label'])
print("Confidence:", result['confidence'])

バッチ処理

predict_batchメソッドを使用して複数のテキストを一度に分類することができます：

texts = ["I am very happy today!", "I am so sad."]
results = classifier.predict_batch(texts)
print("Batch processing results:", results)

可視化

テキストの感情分布を可視化するには：

from emotionclassifier import plot_emotion_distribution

result = classifier.predict("I am very happy today!")
plot_emotion_distribution(result['probabilities'], classifier.labels.values())

コマンドラインインターフェース（CLI）の使用

コマンドラインからパッケージを使用することもできます：

emotionclassifier --model deberta-v3-small --text "I am very happy today!"

DataFrameとの統合

pandas DataFrameと統合してテキスト列を分類することができます：

import pandas as pd
from emotionclassifier import DataFrameEmotionClassifier

df = pd.DataFrame({
    'text': ["I am very happy today!", "I am so sad."]
})

classifier = DataFrameEmotionClassifier()
df = classifier.classify_dataframe(df, 'text')
print(df)

感情トレンドの分析

感情のトレンドを分析して可視化することができます：

from emotionclassifier import EmotionTrends

texts = ["I am very happy today!", "I am feeling okay.", "I am very sad."]
trends = EmotionTrends()
emotions = trends.analyze_trends(texts)
trends.plot_trends(emotions)

ファインチューニング

独自のデータセットで事前学習済みモデルをファインチューニングすることができます：

from emotionclassifier.fine_tune import fine_tune_model

# トレーニングデータセットと検証データセットを定義
train_dataset = ...
val_dataset = ...

# モデルをファインチューニング
fine_tune_model(classifier.model, classifier.tokenizer, train_dataset, val_dataset, output_dir='fine_tuned_model')

transformersライブラリの使用

from transformers import AutoModelForSequenceClassification, AutoTokenizer

model_name = "AnkitAI/deberta-v3-small-base-emotions-classifier"
model = AutoModelForSequenceClassification.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)

# 例の使用法
def predict_emotion(text):
    inputs = tokenizer(text, return_tensors="pt", truncation=True, padding=True, max_length=128)
    outputs = model(**inputs)
    logits = outputs.logits
    predictions = logits.argmax(dim=1)
    return predictions

text = "I'm so happy with the results!"
emotion = predict_emotion(text)
print("Detected Emotion:", emotion)

🔧 技術詳細

モデルの詳細

プロパティ	詳細
モデル名	`AnkitAI/deberta-v3-small-base-emotions-classifier`
ベースモデル	`microsoft/deberta-v3-small`
データセット	dair-ai/emotion
ファインチューニング	感情検出のためにファインチューニングされ、怒り、嫌悪、恐怖、喜び、悲しみ、驚きの6つの感情カテゴリの分類ヘッドがあります。