Hypencoder.8_layer Open-source Information Retrieval Model - Free Conversion of Text to Network Calculation for Relevant Scores

Hypencoder.8 Layer

Developed by jfkback

Hypencoder is a dual-encoder model for information retrieval, consisting of a text encoder and a hypernetwork (Hypencoder), capable of converting text into small neural networks for computing relevance scores.

Text Embedding

Transformers

EnglishOpen Source License:MIT #Dual-encoder architecture #Hypernetwork retrieval #Dynamic relevance scoring

Downloads 18

Release Time : 2/12/2025

Model Overview

This model implements information retrieval through a dual-encoder architecture, where the text encoder converts items into 768-dimensional vectors, and the Hypencoder converts text into small neural networks to output relevance scores.

Model Features

Hypernetwork architecture

Uses Hypencoder to convert text into small neural networks, dynamically generating relevance scoring functions

Configurable hidden layers

Offers variants with 2/4/6/8 hidden layers, allowing selection of model complexity based on needs

Dual-encoder design

Combines traditional text encoders with innovative Hypencoder for efficient information retrieval

Model Capabilities

Text feature extraction

Relevance scoring

Information retrieval

Use Cases

Search engines

Query-document relevance assessment

Computes relevance scores between user queries and candidate documents

Effectively ranks retrieval results

Question answering systems

Answer candidate ranking

Ranks candidate answers in QA systems by relevance

Improves answer accuracy

🚀 Hypencoder Model

Hypencoder is a hypernetwork-based model for information retrieval, offering efficient and effective text encoding and relevance scoring.

🚀 Quick Start

This is the official model from the paper Hypencoder: Hypernetworks for Information Retrieval. To use this model, please refer to the Github page, which contains the necessary code and details on how to run the model.

✨ Features

Dual Encoder Architecture: Comprising a text encoder and Hypencoder, it can convert text into vectors and small neural networks for relevance scoring.
Multiple Model Variants: Four models with different numbers of hidden layers in the q - nets are available.

📦 Installation

The installation details are not provided in the original document.

💻 Usage Examples

Basic Usage

from hypencoder_cb.modeling.hypencoder import Hypencoder, HypencoderDualEncoder, TextEncoder
from transformers import AutoTokenizer

dual_encoder = HypencoderDualEncoder.from_pretrained("jfkback/hypencoder.6_layer")
tokenizer = AutoTokenizer.from_pretrained("jfkback/hypencoder.6_layer")

query_encoder: Hypencoder = dual_encoder.query_encoder
passage_encoder: TextEncoder = dual_encoder.passage_encoder

queries = [
    "how many states are there in india",
    "when do concussion symptoms appear",
]

passages = [
    "India has 28 states and 8 union territories.",
    "Concussion symptoms can appear immediately or up to 72 hours after the injury.",
]

query_inputs = tokenizer(queries, return_tensors="pt", padding=True, truncation=True)
passage_inputs = tokenizer(passages, return_tensors="pt", padding=True, truncation=True)

q_nets = query_encoder(input_ids=query_inputs["input_ids"], attention_mask=query_inputs["attention_mask"]).representation
passage_embeddings = passage_encoder(input_ids=passage_inputs["input_ids"], attention_mask=passage_inputs["attention_mask"]).representation

# The passage_embeddings has shape (2, 768), but the q_nets expect the shape
# (num_queries, num_items_per_query, input_hidden_size) so we need to reshape
# the passage_embeddings.

# In the simple case where each q_net only takes one passage, we can just
# reshape the passage_embeddings to (num_queries, 1, input_hidden_size).
passage_embeddings_single = passage_embeddings.unsqueeze(1)
scores = q_nets(passage_embeddings_single)  # Shape (2, 1, 1)
# [
#    [[-12.1192]],
#    [[-13.5832]]
# ]

# In the case where each q_net takes both passages we can reshape the
# passage_embeddings to (num_queries, 2, input_hidden_size).
passage_embeddings_double = passage_embeddings.repeat(2, 1).reshape(2, 2, -1)
scores = q_nets(passage_embeddings_double)  # Shape (2, 2, 1)
# [
#    [[-12.1192], [-32.7046]],
#    [[-34.0934], [-13.5832]]
# ]

📚 Documentation

Model Details

This is a Hypencoder Dual Encoder. It contains two trunks: the text encoder and Hypencoder. The text encoder converts items into 768 - dimension vectors, while the Hypencoder converts text into a small neural network which takes the 768 - dimension vector from the text encoder as input. This small network is then used to output a relevance score.

Model Variants

We released the four models used in the paper. Each model is identical except the small neural networks, which we refer to as q - nets, have different numbers of hidden layers.

Huggingface Repo	Number of Layers
jfkback/hypencoder.2_layer	2
jfkback/hypencoder.4_layer	4
jfkback/hypencoder.6_layer	6
jfkback/hypencoder.8_layer	8

📄 License

The model is released under the MIT license.

📚 Citation

BibTeX:

@misc{killingback2025hypencoderhypernetworksinformationretrieval,
      title={Hypencoder: Hypernetworks for Information Retrieval}, 
      author={Julian Killingback and Hansi Zeng and Hamed Zamani},
      year={2025},
      eprint={2502.05364},
      archivePrefix={arXiv},
      primaryClass={cs.IR},
      url={https://arxiv.org/abs/2502.05364}, 
}

📋 Model Information

Property	Details
Base Model	google - bert/bert - base - uncased
Datasets	microsoft/ms_marco
Language	en
Library Name	transformers
Pipeline Tag	feature - extraction

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご