extended-mind-mpt-7b開源模型 - 支持外部記憶庫檢索，擴展思維必備工具

首頁

Extended Mind Mpt 7b

由normalcomputing開發

基於Mosaic ML架構調整的擴展思維Transformer模型，支持外部記憶庫檢索與關注功能

大型語言模型

Transformers

#外部記憶檢索 #無微調知識增強 #動態記憶更新

下載量 111

發布時間 : 10/20/2023

模型概述

該模型實現了論文中描述的擴展思維方法，能夠檢索並關注外部鍵值對存儲（記憶庫），無需微調即可使用原始模型權重

模型特點

外部記憶庫集成

支持通過token id序列傳遞外部記憶庫，自動完成記憶生成與緩存

動態記憶更新

可通過clear_memories()方法和memory_ids屬性即時更新記憶內容

引用檢索功能

可輸出生成過程中調用的具體記憶索引，增強結果可解釋性

靈活配置

提供記憶類型選擇、相似度屏蔽、特殊token處理等多種參數配置

模型能力

文本生成

外部記憶檢索

上下文感知推理

多輪對話支持

使用案例

知識問答

基於外部知識的問答

通過注入維基百科等外部知識庫，回答需要專業領域知識的問題

示例顯示能準確回答關於數學家格羅滕迪克的入籍時間等具體問題

研究輔助

學術文獻分析

通過注入論文摘要等學術內容，輔助進行文獻綜述和知識關聯

🚀 Extended-Mind-Mpt-7b模型卡片

Extended-Mind-Mpt-7b模型屬於Extended Mind Transformers集合的一部分，實現了我們在論文中描述的方法。該模型能夠檢索並關注外部的鍵值對緩存（即記憶），且未經過微調（原始模型權重未被修改）。

Github：https://github.com/normal-computing/extended-mind-transformers/
ArXiv：https://arxiv.org/abs/2406.02332

原架構和代碼作者：Mosaic ML 開發者：Normal Computing，基於Mosacic ML進行適配 許可證：Apache 2.0

🚀 快速開始

本模型可按以下步驟使用，通過傳入外部記憶，實現特定的文本生成功能。

✨ 主要特性

外部記憶支持：能夠接收並利用外部記憶進行文本生成。
記憶檢索與關注：在生成文本時自動檢索並關注外部記憶。
可配置參數豐富：提供多個參數用於調整模型的行為。

📦 安裝指南

文檔未提及安裝步驟，故跳過此章節。

💻 使用示例

基礎用法

向模型傳遞外部記憶非常簡單。只需在實例化模型時將令牌ID傳遞給模型即可，如以下示例所示。在首次調用model.generate()時，模型會在內部處理記憶的生成和緩存。你可以使用以下命令序列更新記憶：

model.clear_memories()
model.memory_ids = list_of_new_token_ids

設置trust_remote_code=True以避免警告。將記憶作為令牌ID列表傳遞給模型。

from transformers import AutoModelForCausalLM, AutoTokenizer

ag_wiki_entry = """Alexander Grothendieck (/ˈɡroʊtəndiːk/; German pronunciation: [ˌalɛˈksandɐ ˈɡʁoːtn̩ˌdiːk] (listen); French: [ɡʁɔtɛndik]; 28 March 1928 – 13 November 2014) was a stateless (and then, since 1971, French) mathematician who became the leading figure in the creation of modern algebraic geometry.[7][8] His research extended the scope of the field and added elements of commutative algebra, homological algebra, sheaf theory, and category theory to its foundations, while his so-called "relative" perspective led to revolutionary advances in many areas of pure mathematics.[7][9] He is considered by many to be the greatest mathematician of the twentieth century.[10][11]"""

tokenizer_hf = AutoTokenizer.from_pretrained("normalcomputing/extended-mind-mpt-7b")
memories = tokenizer_hf(ag_wiki_entry).input_ids

model_hf = AutoModelForCausalLM.from_pretrained("normalcomputing/extended-mind-mpt-7b", external_memories=memories, trust_remote_code=True)

之後，你可以像往常一樣使用模型生成文本。模型在生成過程中會自動使用記憶。你可以通過向model.generate()方法傳遞新值來更新任何配置參數（如下例中設置了topk）：

inputs = "When did Alexander Grothendieck become a French citizen?"
inputs = tokenizer(inputs, return_tensors="pt").input_ids

outputs = model.generate(inputs, max_length=40, topk=2)
tokenizer.decode(outputs_hf['sequences'][0], skip_special_tokens=True)

高級用法

引用

通過在model.generate()方法中簡單設置output_retrieved_memory_idx=True，你可以檢索生成過程中使用的記憶索引。我們在演示筆記本中詳細介紹了一個示例。

額外配置

LongLLaMA還有其他幾個參數：

屬性	詳情
`memory_type`	字符串類型，可選，默認為`manual`，表示是手動存儲外部記憶還是存儲在向量數據庫中。
`mask_by_sim`	布爾類型，可選，默認為`True`，表示是否根據相似度屏蔽檢索到的記憶。
`sim_threshold`	浮點類型，可選，默認為`0.25`，是屏蔽檢索到的記憶的閾值。
`tokenizer_all_special_ids`	列表類型，可選，默認為`[0, 50278]`，是要從記憶中移除的特殊令牌的ID。
`remove_special_tokens`	布爾類型，可選，默認為`True`，表示是否移除與分詞器特殊ID對應的記憶。