bert - sentiment - analisis - indo開源模型 - 免費為印尼語文本做正負情感分類

首頁

Bert Sentiment Analisis Indo

由bibrani開發

這是一個基於BERT架構的印尼語情感分析模型，能夠將文本分類為正面或負面情感。

文本分類

Safetensors

其他開源協議:MIT #印尼語情感分析 #BERT微調 #高精度分類

下載量 39

發布時間 : 3/20/2025

模型概述

該模型經過微調，專門用於印尼語文本的情感分析任務，能夠準確識別文本中的情感傾向。

模型特點

高準確率

在評估數據集上取得了0.91的準確率，正面情感分類的F1分數達到0.93。

印尼語優化

專門針對印尼語文本進行微調，能夠更好地理解印尼語的語言特點。

高效推理

基於BERT架構，能夠在合理時間內完成文本分類任務。

模型能力

印尼語文本情感分析

二元情感分類（正面/負面）

自然語言處理

使用案例

社交媒體分析

評論情感分析

分析社交媒體上用戶評論的情感傾向

準確識別正面和負面評論

客戶反饋分析

產品評價分類

自動分類電商平臺上的產品評價

幫助企業快速瞭解客戶滿意度

🚀 基於BERT的印尼語情感分析模型

本倉庫包含一個經過微調的BERT模型，用於進行情感分析。該模型經過訓練，可將文本分為兩種情感類別：0（負面）和1（正面）。以下是該模型的性能和訓練細節總結。

🚀 快速開始

安裝依賴

確保你已經安裝了必要的庫：

pip install transformers torch

加載模型

你可以使用transformers庫加載經過微調的BERT模型：

from transformers import BertForSequenceClassification, BertTokenizer
## 加載經過微調的模型和分詞器
model = BertForSequenceClassification.from_pretrained("path_to_model")
tokenizer = BertTokenizer.from_pretrained("path_to_tokenizer")

預處理和預測

對你的輸入文本進行預處理並進行預測：

# prompt: use this model to predict a sentence with output sentiment negatif or positif

from transformers import BertTokenizer, BertForSequenceClassification
import torch

# 加載保存的模型和分詞器
model_path = 'bibrani/bert-sentiment-analisis-indo'
tokenizer = BertTokenizer.from_pretrained(model_path)
model = BertForSequenceClassification.from_pretrained(model_path)

# 設置設備
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model.to(device)
print(device)

def predict_sentiment(text):
    """預測給定文本的情感。

    參數:
        text (str): 輸入文本。

    返回:
        str: "Negative sentiment" 或 "Positive sentiment"。
    """
    # 對輸入文本進行分詞
    inputs = tokenizer(text, padding="max_length", truncation=True, max_length=512, return_tensors="pt")

    # 將輸入移動到設備上
    input_ids = inputs.input_ids.to(device)
    attention_mask = inputs.attention_mask.to(device)

    # 進行推理
    with torch.no_grad():
        outputs = model(input_ids, attention_mask=attention_mask)
        logits = outputs.logits

    # 獲取預測的類別
    predicted_class = torch.argmax(logits, dim=1).item()

    if predicted_class == 0:
        return "Negative sentiment", inputs
    else:
        return "Positive sentiment", inputs

# 示例用法
text_to_predict = "jadi cerita nya saya sedang ingin makan spaghetti dengan meatball yang kalau menurut ekspektasi saya adalah bakso yang terbuat dari cingcang yang biasa digunakan di menu pasta , setelah sampai , ternyata bakso yang digunakan adalah bakso olahan yang biasa dipakai di tukang bakso , bahkan bentuk nya tidak bulat"
sentiment = predict_sentiment(text_to_predict)
print(f"Text: {text_to_predict}")
print(f"Sentiment: {sentiment}")

✨ 主要特性

該模型能夠對印尼語文本進行情感分析，將其分類為積極或消極情感，在評估數據集上取得了較好的性能。

📦 安裝指南

確保你已經安裝了必要的庫：

pip install transformers torch

💻 使用示例

基礎用法

from transformers import BertForSequenceClassification, BertTokenizer
## 加載經過微調的模型和分詞器
model = BertForSequenceClassification.from_pretrained("path_to_model")
tokenizer = BertTokenizer.from_pretrained("path_to_tokenizer")

高級用法

# prompt: use this model to predict a sentence with output sentiment negatif or positif

from transformers import BertTokenizer, BertForSequenceClassification
import torch

# 加載保存的模型和分詞器
model_path = 'bibrani/bert-sentiment-analisis-indo'
tokenizer = BertTokenizer.from_pretrained(model_path)
model = BertForSequenceClassification.from_pretrained(model_path)

# 設置設備
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model.to(device)
print(device)

def predict_sentiment(text):
    """預測給定文本的情感。

    參數:
        text (str): 輸入文本。

    返回:
        str: "Negative sentiment" 或 "Positive sentiment"。
    """
    # 對輸入文本進行分詞
    inputs = tokenizer(text, padding="max_length", truncation=True, max_length=512, return_tensors="pt")

    # 將輸入移動到設備上
    input_ids = inputs.input_ids.to(device)
    attention_mask = inputs.attention_mask.to(device)

    # 進行推理
    with torch.no_grad():
        outputs = model(input_ids, attention_mask=attention_mask)
        logits = outputs.logits

    # 獲取預測的類別
    predicted_class = torch.argmax(logits, dim=1).item()

    if predicted_class == 0:
        return "Negative sentiment", inputs
    else:
        return "Positive sentiment", inputs

# 示例用法
text_to_predict = "jadi cerita nya saya sedang ingin makan spaghetti dengan meatball yang kalau menurut ekspektasi saya adalah bakso yang terbuat dari cingcang yang biasa digunakan di menu pasta , setelah sampai , ternyata bakso yang digunakan adalah bakso olahan yang biasa dipakai di tukang bakso , bahkan bentuk nya tidak bulat"
sentiment = predict_sentiment(text_to_predict)
print(f"Text: {text_to_predict}")
print(f"Sentiment: {sentiment}")