t5-base-qa-summary-emotion開源模型 - 支持問答、摘要與情感檢測功能

首頁

T5 Base Qa Summary Emotion

由kiri-ai開發

基於T5架構的多功能模型，整合了問答系統、文本摘要和情感檢測功能，在多個數據集上進行了微調。

大型語言模型

Transformers

英語開源協議:Apache-2.0 #多任務問答 #上下文感知 #情感檢測

下載量 45

發布時間 : 3/2/2022

模型概述

該模型基於T5架構，通過CoQA、SQuAD 2、GoEmotions和CNN/DailyMail數據集微調，能夠執行問答、文本摘要和情感分析任務。

模型特點

多功能集成

單一模型同時支持問答、摘要生成和情感分析三種功能

對話式問答支持

能夠處理多輪對話上下文，理解前後問題關聯

多數據集微調

在CoQA、SQuAD 2等多個權威數據集上進行優化

模型能力

問答系統

文本摘要

情感檢測

對話理解

上下文關聯分析

使用案例

智能客服

多輪對話支持

處理用戶連續提問，理解問題上下文關聯

在SQuAD 2開發集上F1 79.5分，CoQA開發集F1 70.6分

內容分析

新聞摘要生成

自動生成新聞文章的關鍵摘要

用戶評論情感分析

識別文本中表達的情感傾向

🚀 T5 Base 模型：問答 + 摘要 + 情感分析

本模型融合了問答、文本摘要和情感檢測功能，在多個權威數據集上進行微調訓練，能為文本處理任務提供高效準確的解決方案。

🚀 快速開始

依賴項

需要 transformers>=4.0.0

✨ 主要特性

多任務支持：支持問答、文本摘要和情感檢測三種任務。
微調訓練：在 CoQa、Squad 2、GoEmotions 和 CNN/DailyMail 數據集上進行微調。
優秀表現：在 Squad 2 開發集上達到 F1 79.5 的分數，在 CoQa 開發集上達到 F1 70.6 的分數。

📦 安裝指南

確保安裝 transformers 庫，版本需大於等於 4.0.0：

pip install transformers>=4.0.0

💻 使用示例

基礎用法

問答任務

使用 Transformers 庫

from transformers import T5ForConditionalGeneration, T5Tokenizer
model = T5ForConditionalGeneration.from_pretrained("kiri-ai/t5-base-qa-summary-emotion")
tokenizer = T5Tokenizer.from_pretrained("kiri-ai/t5-base-qa-summary-emotion")

def get_answer(question, prev_qa, context):
    input_text = [f"q: {qa[0]} a: {qa[1]}" for qa in prev_qa]
    input_text.append(f"q: {question}")
    input_text.append(f"c: {context}")
    input_text = " ".join(input_text)
    features = tokenizer([input_text], return_tensors='pt')
    tokens = model.generate(input_ids=features['input_ids'], 
            attention_mask=features['attention_mask'], max_length=64)
    return tokenizer.decode(tokens[0], skip_special_tokens=True)

print(get_answer("Why is the moon yellow?", "I'm not entirely sure why the moon is yellow.")) # unknown

context = "Elon Musk left OpenAI to avoid possible future conflicts with his role as CEO of Tesla."

print(get_answer("Why not?", [("Does Elon Musk still work with OpenAI", "No")], context)) # to avoid possible future conflicts with his role as CEO of Tesla

使用 Kiri 庫

from kiri.models import T5QASummaryEmotion

context = "Elon Musk left OpenAI to avoid possible future conflicts with his role as CEO of Tesla."
prev_qa = [("Does Elon Musk still work with OpenAI", "No")]
model = T5QASummaryEmotion()

# Leave prev_qa blank for non conversational question-answering
model.qa("Why not?", context, prev_qa=prev_qa)
> "to avoid possible future conflicts with his role as CEO of Tesla"

文本摘要任務

使用 Transformers 庫

from transformers import T5ForConditionalGeneration, T5Tokenizer
model = T5ForConditionalGeneration.from_pretrained("kiri-ai/t5-base-qa-summary-emotion")
tokenizer = T5Tokenizer.from_pretrained("kiri-ai/t5-base-qa-summary-emotion")

def summary(context):
    input_text = f"summarize: {context}"
    features = tokenizer([input_text], return_tensors='pt')
    tokens = model.generate(input_ids=features['input_ids'], 
            attention_mask=features['attention_mask'], max_length=64)
    return tokenizer.decode(tokens[0], skip_special_tokens=True)

使用 Kiri 庫

from kiri.models import T5QASummaryEmotion

model = T5QASummaryEmotion()

model.summarise("Long text to summarise")
> "Short summary of long text"

情感檢測任務

使用 Transformers 庫

from transformers import T5ForConditionalGeneration, T5Tokenizer
model = T5ForConditionalGeneration.from_pretrained("kiri-ai/t5-base-qa-summary-emotion")
tokenizer = T5Tokenizer.from_pretrained("kiri-ai/t5-base-qa-summary-emotion")

def emotion(context):
    input_text = f"emotion: {context}"
    features = tokenizer([input_text], return_tensors='pt')
    tokens = model.generate(input_ids=features['input_ids'], 
            attention_mask=features['attention_mask'], max_length=64)
    return tokenizer.decode(tokens[0], skip_special_tokens=True)

使用 Kiri 庫

from kiri.models import T5QASummaryEmotion

model = T5QASummaryEmotion()

model.emotion("I hope this works!")
> "optimism"

📚 詳細文檔

描述

該模型在 CoQa、Squad 2、GoEmotions 和 CNN/DailyMail 數據集上進行了微調。

在 Squad 2 開發集上達到了 F1 79.5 的分數，在 CoQa 開發集上達到了 F1 70.6 的分數。

文本摘要和情感檢測功能尚未進行評估。

📄 許可證

本項目採用 Apache-2.0 許可證。

關於我們

Kiri 讓使用最先進的模型變得簡單、便捷且可擴展。

官網 | 自然語言引擎

📦 相關信息

屬性	詳情
模型類型	文本到文本生成
訓練數據	CoQa、Squad 2、GoEmotions、CNN/DailyMail
評估指標	F1

精選推薦AI模型

Llama 3 Typhoon V1.5x 8b Instruct

專為泰語設計的80億參數指令模型，性能媲美GPT-3.5-turbo，優化了應用場景、檢索增強生成、受限生成和推理任務

Cadet-Tiny是一個基於SODA數據集訓練的超小型對話模型，專為邊緣設備推理設計，體積僅為Cosmo-3B模型的2%左右。

Roberta Base Chinese Extractive Qa

基於RoBERTa架構的中文抽取式問答模型，適用於從給定文本中提取答案的任務。

智啟未來，您的人工智能解決方案智庫