開源intent-classifier意圖分類模型 - 免費部署快速將客戶問題歸類

首頁

Intent Classifier

由Serj開發

基於Flan-T5-Base微調的意圖分類模型，用於將客戶問題歸類到預定義類別

文本分類

Transformers

#動態意圖分類 #多領域適配 #小樣本微調

下載量 364

發布時間 : 4/2/2024

模型概述

該模型通過使用合成數據對T5模型進行微調，能夠動態地將客戶請求分類到預定義的主題類別中，適用於客戶服務場景的意圖識別。

模型特點

動態分類

通過將所有類別添加到提示中，實現動態意圖分類

多場景適用

支持不同業務場景（如披薩餐廳、在線銀行等）的意圖分類

小樣本微調

在少量樣本（每類10-20個）上微調即可獲得良好性能

模型能力

客戶意圖識別

主題分類

文本分類

客戶服務自動化

使用案例

客戶服務

退款請求處理

自動識別客戶關於退款請求的意圖

準確分類到'退款請求'類別

訂閱管理

識別客戶取消或恢復訂閱的請求

準確分類到'取消訂閱'或'恢復訂閱'類別

在線服務

銀行服務諮詢

分類客戶關於在線銀行服務的諮詢問題

🚀 意圖分類模型

本模型可對客戶請求進行意圖分類，通過微調T5模型，利用包含合成數據的提示，能動態地將客戶請求分類到預定義的類別中。

🚀 快速開始

模型使用示例

class IntentClassifier:
    def __init__(self, model_name="serj/intent-classifier", device="cuda"):
        self.model = T5ForConditionalGeneration.from_pretrained(model_name).to(device)
        self.tokenizer = T5Tokenizer.from_pretrained(model_name)
        self.device = device


def build_prompt(text, prompt="", company_name="", company_specific=""):
    if company_name == "Pizza Mia":
        company_specific = "This company is a pizzeria place."
    if company_name == "Online Banking":
        company_specific = "This company is an online banking."

    return f"Company name: {company_name} is doing: {company_specific}\nCustomer: {text}.\nEND MESSAGE\nChoose one topic that matches customer's issue.\n{prompt}\nClass name: "


def predict(self, text, prompt_options, company_name, company_portion) -> str:
    input_text = build_prompt(text, prompt_options, company_name, company_portion)
    # print(input_text)
    # Tokenize the concatenated inp_ut text
    input_ids = self.tokenizer.encode(input_text, return_tensors="pt", max_length=512, truncation=True).to(self.device)

    # Generate the output
    output = self.model.generate(input_ids)

    # Decode the output tokens
    decoded_output = self.tokenizer.decode(output[0], skip_special_tokens=True)

    return decoded_output


m = IntentClassifier("serj/intent-classifier")
print(m.predict("Hey, after recent changes, I want to cancel subscription, please help.",
                "OPTIONS:\n refund\n cancel subscription\n damaged item\n return item\n", "Company",
                "Products and subscriptions"))

提示結構說明

Topic %% Customer: text. END MESSAGE OPTIONS: each class separated by % Choose one topic that matches customer's issue. Class name:

你必須在文本末尾加上句號，否則會得到奇怪的結果，這是模型的訓練要求。

✨ 主要特性

本模型通過微調Flan - T5 - Base模型，利用包含合成數據的提示對客戶請求進行意圖分類，可動態地將客戶請求分類到預定義的類別中。

📦 安裝指南

文檔未提供具體安裝步驟，暫不展示。

📚 詳細文檔

模型詳情

模型描述

這是一個🤗 transformers模型的模型卡片，已推送到Hub，此模型卡片是自動生成的。

開發者：Serj Smorodinsky
模型類型：Flan - T5 - Base
語言（NLP）：[待補充更多信息]
許可證：[待補充更多信息]
微調基礎模型：Flan - T5 - Base

模型來源

倉庫地址：https://github.com/SerjSmor/intent_classification

訓練詳情

訓練數據

訓練數據倉庫：https://github.com/SerjSmor/intent_classification
未來將添加HF數據集。

訓練過程

訓練腳本地址：https://github.com/SerjSmor/intent_classification/blob/main/t5_generator_trainer.py
使用HF trainer進行訓練：

training_args = TrainingArguments(
    output_dir='./results',
    num_train_epochs=epochs,
    per_device_train_batch_size=batch_size,
    per_device_eval_batch_size=batch_size,
    warmup_steps=500,
    weight_decay=0.01,
    logging_dir='./logs',
    logging_steps=10,
    evaluation_strategy="epoch"
)

trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=train_dataset,
    eval_dataset=val_dataset,
    tokenizer=tokenizer,
    # compute_metrics=compute_metrics
)