Extended-mind-mpt-7b Open Source Model - Supports External Memory Library Retrieval, an Essential Tool for Expanding Thinking

Extended Mind Mpt 7b

Developed by normalcomputing

An extended mind Transformer model adjusted based on the Mosaic ML architecture, supporting external memory library retrieval and attention functions

Large Language Model

Transformers

#External memory retrieval #Knowledge enhancement without fine-tuning #Dynamic memory update

Downloads 111

Release Time : 10/20/2023

Model Overview

This model implements the extended mind method described in the paper, capable of retrieving and focusing on external key-value pair storage (memory library), and can use the original model weights without fine-tuning

Model Features

External memory library integration

Supports passing external memory libraries through token id sequences and automatically completes memory generation and caching

Dynamic memory update

Can update memory content in real-time through the clear_memories() method and the memory_ids attribute

Reference retrieval function

Can output the specific memory indexes called during the generation process to enhance the interpretability of the results

Flexible configuration

Provides various parameter configurations such as memory type selection, similarity masking, and special token processing

Model Capabilities

Text generation

External memory retrieval

Context-aware reasoning

Multi-round dialogue support

Use Cases

Knowledge Q&A

Q&A based on external knowledge

By injecting external knowledge bases such as Wikipedia, answer questions that require professional domain knowledge

Examples show that it can accurately answer specific questions such as the naturalization time of mathematician Grothendieck

Research assistance

Academic literature analysis

By injecting academic content such as paper abstracts, assist in literature reviews and knowledge association

🚀 Extended-Mind-Mpt-7b Model Card

This model is part of the Extended Mind Transformers collection. It implements the methods described in our paper. The model retrieves and attends to an external cache of key - value pairs (or memories), and its original model weights have not been edited.

Github: https://github.com/normal - computing/extended - mind - transformers/
ArXiv: https://arxiv.org/abs/2406.02332

Original architecture and code: By Mosaic ML.

Developed by: Normal Computing, Adapted from Mosacic ML
License: Apache 2.0

🚀 Quick Start

This section provides a quick overview of how to use the model.

✨ Features

This model is part of the Extended Mind Transformers collection.
It can retrieve and attend to an external cache of key - value pairs (memories).
The original model weights remain unedited.

📦 Installation

No specific installation steps are provided in the original document, so this section is skipped.

💻 Usage Examples

Basic Usage

External Memory

Passing external memories to the model is straightforward. Just pass the token ids to the model during instantiation, as shown in the following examples. Generating and caching the memories are handled internally during the first model.generate() call. You can update the memories using the following commands:

model.clear_memories()
model.memory_ids = list_of_new_token_ids

Set trust_remote_code=True to avoid warnings. Pass the memories to the model as a list of token ids.

from transformers import AutoModelForCausalLM, AutoTokenizer

ag_wiki_entry = """Alexander Grothendieck (/ˈɡroʊtəndiːk/; German pronunciation: [ˌalɛˈksandɐ ˈɡʁoːtn̩ˌdiːk] (listen); French: [ɡʁɔtɛndik]; 28 March 1928 – 13 November 2014) was a stateless (and then, since 1971, French) mathematician who became the leading figure in the creation of modern algebraic geometry.[7][8] His research extended the scope of the field and added elements of commutative algebra, homological algebra, sheaf theory, and category theory to its foundations, while his so - called "relative" perspective led to revolutionary advances in many areas of pure mathematics.[7][9] He is considered by many to be the greatest mathematician of the twentieth century.[10][11]"""

tokenizer_hf = AutoTokenizer.from_pretrained("normalcomputing/extended-mind-mpt-7b")
memories = tokenizer_hf(ag_wiki_entry).input_ids

model_hf = AutoModelForCausalLM.from_pretrained("normalcomputing/extended-mind-mpt-7b", external_memories=memories, trust_remote_code=True)

After this, you can generate text with the model as usual. The model will automatically use the memories during generation. You can update any config parameters (we set topk below) by passing new values to the model.generate() method.

inputs = "When did Alexander Grothendieck become a French citizen?"
inputs = tokenizer(inputs, return_tensors="pt").input_ids

outputs = model.generate(inputs, max_length=40, topk=2)
tokenizer.decode(outputs_hf['sequences'][0], skip_special_tokens=True)

Advanced Usage

Citations

By simply setting output_retrieved_memory_idx=True in the model.generate() method, you can retrieve the memory indices used during generation. We walk through an example in the demo notebook.

Additional configuration

LongLLaMA has several other parameters:

Property	Details
`memory_type`	(`string`, optional, defaults to `manual`): Whether to store external memories manually or in a vector database.
`mask_by_sim`	(`bool`, optional, defaults to `True`): Whether or not to mask retrieved memories by similarity.
`sim_threshold`	(`float`, optional, defaults to `0.25`): Threshold for masking retrieved memories.
`tokenizer_all_special_ids`	(`list`, optional, defaults to `[0, 50278]`): Ids for special tokens to remove from memories.
`remove_special_tokens`	(`bool`, optional, defaults to `True`): Remove memories that correspond to tokenizer special ids.

Additionally, the stride used to compute the memory representations can be set within the generate_cache() method. Smaller strides generate higher - quality representations, while larger strides require fewer computations.

📚 Documentation

The model's basic and advanced usage, as well as its additional configuration parameters, are described in the "Usage Examples" section.

🔧 Technical Details

No specific technical details (more than 50 words) are provided in the original document, so this section is skipped.

📄 License

This model is licensed under the Apache 2.0 license.

Limitations

This model is part of ongoing research at Normal Computing.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご