KeywordGen-v2開源關鍵詞生成模型 - 免費部署助力產品評論分析

首頁

Keywordgen V2

由mrutyunjay-patil開發

KeywordGen-v2是基於T5的模型，專門用於從文本中生成關鍵詞，特別適用於產品評論分析。

文本生成

Transformers

支持多種語言開源協議:Apache-2.0 #產品評論關鍵詞提取 #T5微調 #電商文本分析

下載量 83

發布時間 : 8/6/2023

模型概述

該模型通過微調T5基礎模型，能夠從輸入文本中提取2到8個單詞的關鍵詞，尤其擅長處理產品評論，幫助用戶快速獲取文本的核心主題。

模型特點

產品評論優化

特別針對產品評論進行優化，能夠高效提取關鍵點或主題。

多關鍵詞生成

能夠生成2到8個單詞的關鍵詞，覆蓋文本的多個核心主題。

前綴優化

通過在輸入前添加'Keyword: '前綴，顯著提升生成效果。

模型能力

文本生成

關鍵詞提取

產品評論分析

使用案例

產品評論分析

電子產品評論關鍵詞提取

從電子產品評論中提取關鍵詞，幫助快速瞭解用戶反饋的核心點。

生成如'屏幕色彩鮮豔'、'電池續航出色'等關鍵詞。

多語言評論分析

支持從英文評論中提取關鍵詞，適用於國際化產品分析。

生成與評論內容高度相關的英文關鍵詞。

🚀 KeywordGen-v2模型

KeywordGen-v2是一款基於T5架構微調的模型，專門用於從文本中提取關鍵詞。輸入一段文本，模型就能輸出與之相關的關鍵詞，為信息提取提供便利。

🚀 快速開始

你可以直接使用文本生成管道來調用這個模型。使用時，為了獲得最佳效果，請在輸入文本前加上 "Keyword: "。

以下是使用Hugging Face Transformers庫在Python中調用該模型的示例：

💻 使用示例

基礎用法

from transformers import T5Tokenizer, T5ForConditionalGeneration

# Initialize the tokenizer and model
tokenizer = T5Tokenizer.from_pretrained("mrutyunjay-patil/keywordGen-v2")
model = T5ForConditionalGeneration.from_pretrained("mrutyunjay-patil/keywordGen-v2")

# Define your input sequence, prefixing with "Keyword: "
input_sequence = "Keyword: I purchased the new Android smartphone last week and I've been thoroughly impressed. The display is incredibly vibrant and sharp, and the battery life is surprisingly good, easily lasting a full day with heavy usage."

# Encode the input sequence
input_ids = tokenizer.encode(input_sequence, return_tensors="pt")

# Generate output
outputs = model.generate(input_ids)
output_sequence = tokenizer.decode(outputs[0], skip_special_tokens=True)

print(output_sequence)

高級用法

from transformers import T5Tokenizer, T5ForConditionalGeneration

# Initialize the tokenizer and model
tokenizer = T5Tokenizer.from_pretrained("mrutyunjay-patil/keywordGen-v2")
model = T5ForConditionalGeneration.from_pretrained("mrutyunjay-patil/keywordGen-v2")

# Define the prefix
task_prefix = "Keyword: "

# Define your list of input sequences
inputs = [
    "Absolutely love this tablet. It has a clear, sharp screen and runs apps smoothly without any hiccups.",
    "The headphones are fantastic with great sound quality, but the build quality could be better.",
    "Bought this smartwatch last week, and I'm thrilled with its performance. Battery life is impressive.",
    "This laptop exceeded my expectations. Excellent speed, plenty of storage, and light weight. Perfect for my needs.",
    "The camera quality on this phone is exceptional. It captures detailed and vibrant photos. However, battery life is not the best."
]

# Loop through each input and generate keywords
for sample in inputs:
    input_sequence = task_prefix + sample
    input_ids = tokenizer.encode(input_sequence, return_tensors="pt")
    outputs = model.generate(input_ids)
    output_sequence = tokenizer.decode(outputs[0], skip_special_tokens=True)
    print(sample, "\n --->", output_sequence)

✨ 主要特性

針對性微調：該模型 "KeywordGen-v2" 是 "KeywordGen" 系列的第二代產品，基於T5基礎模型進行微調，專注於從文本輸入中生成關鍵詞，尤其擅長處理產品評論。
輸出規範：模型輸出的關鍵詞長度在2到8個單詞之間，能有效提取產品評論中的關鍵點或主題。
輸入要求：當輸入文本至少有2 - 3個句子時，模型性能更佳。