M2-BERT-2k-Retrieval-Encoder-V1 Open-Source Model - Free Deployment to Boost Long Context Retrieval Tasks

M2 BERT 2k Retrieval Encoder V1

Developed by hazyresearch

80M-parameter M2-BERT-2k model checkpoint, specifically designed for long-context retrieval tasks, supporting a context length of 2048 tokens.

Text Embedding

Transformers

EnglishOpen Source License:Apache-2.0 #Long Text Retrieval #2048 Context Length #LoCo Optimization

Downloads 80

Release Time : 5/22/2024

Model Overview

M2-BERT is an improved model based on the BERT architecture, specifically optimized for long-context retrieval tasks. It can generate 768-dimensional embedding vectors, suitable for scenarios such as information retrieval.

Model Features

Long Context Support

Supports processing of long contexts up to 2048 tokens, ideal for long-document retrieval tasks

Efficient Retrieval Embeddings

Generates high-quality 768-dimensional embedding vectors optimized for retrieval performance

Lightweight Architecture

Lightweight design with only 80M parameters, reducing computational resource requirements while maintaining performance

Model Capabilities

Text Embedding Generation

Long Text Processing

Information Retrieval

Use Cases

Information Retrieval

Document Retrieval

Using model-generated embeddings for similar document retrieval

Effectively handles documents up to 2048 tokens in length

Semantic Search

Content search system based on semantic similarity

🚀 Monarch Mixer-BERT

Monarch Mixer-BERT is the 80M checkpoint for M2-BERT-2k from the paper Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT. It offers a solution for long - context retrieval tasks, providing valuable embeddings for retrieval operations.

Check out our GitHub for instructions on how to download and fine - tune it!

🚀 Quick Start

✨ Features

This model can generate embeddings for retrieval, with the embeddings having a dimensionality of 768.
It uses the Hugging Face bert - base - uncased tokenizer.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

You can load this model using Hugging Face AutoModel:

from transformers import AutoModelForMaskedLM, BertConfig
config = BertConfig.from_pretrained("hazyresearch/M2-BERT-2K-Retrieval-Encoder-V1")
model = AutoModelForMaskedLM.from_pretrained("hazyresearch/M2-BERT-2k-Retrieval-Encoder-V1", config=config, trust_remote_code=True)

This model uses the Hugging Face bert-base-uncased tokenizer:

from transformers import BertTokenizer
tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')

Advanced Usage

This model generates embeddings for retrieval. The embeddings have a dimensionality of 768:

from transformers import AutoTokenizer, AutoModelForMaskedLM, BertConfig

max_seq_length = 2048
testing_string = "Every morning, I make a cup of coffee to start my day."
config = BertConfig.from_pretrained("hazyresearch/M2-BERT-2K-Retrieval-Encoder-V1")
model = AutoModelForMaskedLM.from_pretrained("hazyresearch/M2-BERT-2k-Retrieval-Encoder-V1", config=config, trust_remote_code=True)

tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased", model_max_length=max_seq_length)
input_ids = tokenizer([testing_string], return_tensors="pt", padding="max_length", return_token_type_ids=False, truncation=True, max_length=max_seq_length)

outputs = model(**input_ids)
embeddings = outputs['sentence_embedding']

Remote Code

This model requires trust_remote_code=True to be passed to the from_pretrained method. This is because we use custom PyTorch code (see our GitHub). You should consider passing a revision argument that specifies the exact git commit of the code, for example:

mlm = AutoModelForMaskedLM.from_pretrained(
   "hazyresearch/M2-BERT-2k-Retrieval-Encoder-V1",
   config=config,
   trust_remote_code=True,
)

Configuration

⚠️ Important Note

Note use_flash_mm is false by default. Using FlashMM is currently not supported.

📄 License

This project is licensed under the Apache - 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご