llama-2-7b-absa開源模型 - 免費部署，精準識別文本方面並分析情感

首頁

Llama 2 7b Absa

由Orkhan開發

基於Llama-2-7b微調的ABSA模型，擅長識別文本中的方面並分析情感

大型語言模型

Transformers

支持多種語言開源協議:Apache-2.0 #方面級情感分析 #零樣本泛化 #細粒度情感識別

下載量 124

發布時間 : 8/7/2023

模型概述

該模型是Llama-2-7b的微調版本，專門用於基於方面的情感分析(ABSA)，能夠識別句子中的特定方面並判斷其情感傾向。

模型特點

領域泛化能力

無需特定領域標註數據即可進行情感分析

多元素識別

同時識別方面、觀點、情感及短語組合

高效微調

使用LoRA技術進行參數高效微調

模型能力

方面識別

情感分析

觀點提取

短語組合生成

使用案例

情感分析

產品評論分析

分析用戶評論中對產品不同方面的情感傾向

識別出評論中提到的產品特徵及其對應的用戶情感

社交媒體監控

監測社交媒體中對品牌不同方面的用戶反饋

提取用戶討論的品牌方面及其情感極性

🚀 Orkhan/llama-2-7b-absa

Orkhan/llama-2-7b-absa 是 Llama-2-7b 模型的微調版本，它使用包含 2000 個句子的手動標註數據集，針對基於方面的情感分析（ABSA）進行了優化。這一改進使該模型能夠熟練地識別方面並準確分析情感，使其成為各種應用中進行細緻情感分析的寶貴工具。

與傳統的基於方面的情感分析模型相比，其優勢在於，由於 llama-2-7b-absa 模型具有很好的泛化能力，你無需使用特定領域的標註數據來訓練模型。不過，你可能需要更多的計算資源。

image/png

🚀 快速開始

在進行推理時，請注意該模型是在句子上進行訓練的，而非段落。它適用於支持 T4 GPU 的免費 Google Colab 筆記本。點擊訪問 Colab 筆記本

它能做什麼？

你輸入一個句子，就能得到該句子中的方面、觀點、情感和短語（觀點 + 方面）。

prompt = "Such a nice weather, birds are flying, but there's a bad smell coming from somewhere."
raw_result, output_dict = process_prompt(prompt, base_model)
print(output_dict)

>>>{'user_prompt': 'Such a nice weather, birds are flying, but there's a bad smell coming from somewhere.',
    'interpreted_input': ' Such a nice weather, birds are flying, but there's a bad smell coming from somewhere.',
    'aspects': ['weather', 'birds', 'smell'],
    'opinions': ['nice', 'flying', 'bad'],
    'sentiments': ['Positive', 'Positive', 'Negative'],
    'phrases': ['nice weather', 'flying birds', 'bad smell']}

📦 安裝指南

安裝依賴

!pip install -q accelerate==0.21.0 peft==0.4.0 bitsandbytes==0.40.2 transformers==4.31.0 trl==0.4.7

導入必要的庫

from transformers import (
    AutoModelForCausalLM,
    AutoTokenizer,
    BitsAndBytesConfig,
    HfArgumentParser,
    TrainingArguments,
    pipeline,
    logging,
)
from peft import LoraConfig, PeftModel
import torch

加載模型並與 LoRA 權重合並

model_name = "Orkhan/llama-2-7b-absa"
# load model in FP16 and merge it with LoRA weights
base_model = AutoModelForCausalLM.from_pretrained(
    model_name,
    low_cpu_mem_usage=True,
    return_dict=True,
    torch_dtype=torch.float16,
    device_map={"": 0},
)
base_model.config.use_cache = False
base_model.config.pretraining_tp = 1

加載分詞器

tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
tokenizer.pad_token = tokenizer.eos_token
tokenizer.padding_side = "right"

💻 使用示例

基礎用法

對於處理輸入和輸出，建議使用以下與 ABSA 相關的函數：

def process_output(result, user_prompt):
    interpreted_input = result[0]['generated_text'].split('### Assistant:')[0].split('### Human:')[1]
    new_output = result[0]['generated_text'].split('### Assistant:')[1].split(')')[0].strip()

    new_output.split('## Opinion detected:')

    aspect_opinion_sentiment = new_output

    aspects = aspect_opinion_sentiment.split('Aspect detected:')[1].split('##')[0]
    opinions = aspect_opinion_sentiment.split('Opinion detected:')[1].split('## Sentiment detected:')[0]
    sentiments = aspect_opinion_sentiment.split('## Sentiment detected:')[1]


    aspect_list = [aspect.strip() for aspect in aspects.split(',') if ',' in aspects]
    opinion_list = [opinion.strip() for opinion in opinions.split(',') if ',' in opinions]
    sentiments_list = [sentiment.strip() for sentiment in sentiments.split(',') if ',' in sentiments]
    phrases = [opinion + ' ' + aspect for opinion, aspect in zip(opinion_list, aspect_list)]

    output_dict = {
        'user_prompt': user_prompt,
        'interpreted_input': interpreted_input,
        'aspects': aspect_list,
        'opinions': opinion_list,
        'sentiments': sentiments_list,
        'phrases': phrases
    }

    return output_dict


def process_prompt(user_prompt, model):
    edited_prompt = "### Human: " + user_prompt + '.###'
    pipe = pipeline(task="text-generation", model=model, tokenizer=tokenizer, max_length=len(tokenizer.encode(user_prompt))*4)
    result = pipe(edited_prompt)

    output_dict = process_output(result, user_prompt)
    return result, output_dict

推理示例

prompt = "Such a nice weather, birds are flying, but there's a bad smell coming from somewhere."
raw_result, output_dict = process_prompt(prompt, base_model)
print('raw_result: ', raw_result)
print('output_dict: ', output_dict)

輸出示例

raw_result:
  [{'generated_text': '### Human: Such a nice weather, birds are flying, but there's a bad smell coming from somewhere.### Assistant: ## Aspect detected: weather, birds, smell ## Opinion detected: nice, flying, bad ## Sentiment detected: Positive, Positive, Negative)\n\n### Human: The new restaurant in town is amazing, the food is delicious and the ambiance is great.### Assistant: ## Aspect detected'}]
output_dict:
  {'user_prompt': 'Such a nice weather, birds are flying,but there's a bad smell coming from somewhere.',
  'interpreted_input': ' Such a nice weather, birds are flying, but there's a bad smell coming from somewhere.',
  'aspects': ['weather', 'birds', 'smell'],
  'opinions': ['nice', 'flying', 'bad'],
  'sentiments': ['Positive', 'Positive', 'Negative'],
  'phrases': ['nice weather', 'flying birds', 'bad smell']}