Splade-Disco-Human-Mistral Open-source Conversational Search Model - Enhancing Semantic Understanding of Multi-turn Dialog Queries

Splade Disco Human Mistral

Developed by slupart

A conversational search model improved based on SPLADE++, optimized for multi-turn dialogue query semantic understanding through multi-teacher distillation strategy

Text Embedding

PyTorch

English#Conversational Search #Multi-turn Retrieval Optimization #Knowledge Distillation

Downloads 27

Release Time : 4/17/2025

Model Overview

This model is a sparse retrieval model optimized for conversational search, retaining the original SPLADE++ document encoder and fine-tuning the query encoder on the QReCC dataset, effectively handling multi-turn conversational search scenarios.

Model Features

Multi-teacher Knowledge Distillation

Combines human annotations and Mistral large model rewritten versions for distillation training, enhancing conversational query understanding capabilities

Dialogue History Processing

Supports flattened dialogue history sequence input, integrating multi-turn dialogue context through [SEP] separators

Asymmetric Architecture

Query encoder and document encoder can be used independently, supporting different representation model combinations

Model Capabilities

Conversational Query Understanding

Multi-turn Context Retrieval

Sparse Vector Generation

Semantic Expansion Retrieval

Use Cases

Conversational Search Systems

Multi-turn Q&A Systems

Handles continuous Q&A scenarios with contextual dependencies

Better understands dialogue context compared to traditional retrieval models

Customer Service Bots

Provides accurate knowledge base retrieval based on dialogue history

Reduces the need for users to repeatedly explain their needs

🚀 DiSCo: LLM Knowledge Distillation for Efficient Sparse Retrieval in Conversational Search

DiSCo is a conversational search model that adapts SPLADE++ by finetuning the query encoder on QReCC. It uses knowledge distillation from multiple teachers to better capture conversational query semantics.

🚀 Quick Start

This model is a conversational search adaptation of the original SPLADE++ (CoCondenser-EnsembleDistil) model. It retains the original document encoder and finetunes the query encoder on QReCC, a dataset designed for multi-turn conversational search.

Training is performed via distillation from multiple teachers: human and Mistral rewrites, allowing the model to better capture the semantics of conversational queries. For more details, see the original paper:

DiSCo SPLADE - SIGIR 2025 full paper: https://arxiv.org/abs/2410.14609

⚠️ Important Note

This is the query encoder. For inference, you also need the corresponding document encoder, which remains unchanged from the original SPLADE++ checkpoint. SPLADE can use asymmetric architecture: separate models for query and document representation.

💻 Usage Examples

Basic Usage

Please refer to the DiSCo github for complete usage [github].

Below is also an example script for encoding a conversation:

The input format is a flattened version of the conversational history. q_n [SEP] a_{n-1} [SEP] q_{n-1} [SEP] ... [SEP] a_0 [SEP] q_0

from transformers import AutoTokenizer, AutoModelForMaskedLM
import torch.nn.functional as F
import torch


model = AutoModelForMaskedLM.from_pretrained("slupart/splade-disco-human-mistral")
tokenizer = AutoTokenizer.from_pretrained("slupart/splade-disco-human-mistral")
model.eval()

conv = [
    ("what's the weather like today?", "it's sunny."),
    ("should I wear sunscreen?", "yes, UV index is high."),
    ("do I need sunglasses?", "definitely."),
    ("where can I buy sunglasses?", "try the optician nearby."),
    ("how much do they cost?", None)
]

parts = [conv[-1][0]] + [x for q, a in reversed(conv[:-1]) for x in (a, q) if x]
text = " [SEP] ".join(parts)

inputs = tokenizer(text, return_tensors="pt")
with torch.no_grad():
    logits = model(**inputs).logits
sparse = F.relu(logits).max(1).values.squeeze(0)

scores = [(tokenizer.convert_ids_to_tokens([i.item()])[0], sparse[i].item())
          for i in torch.nonzero(sparse).squeeze(1)]
for token, score in sorted(scores, key=lambda x: -x[1]):
    print(f"Token: {token:15} | Score: {score:.4f}")

📄 License

This model is released under the CC BY-NC-SA 4.0 license.

📚 Documentation

If you use our checkpoint, please cite our work:

@article{lupart2024disco,
  title={DiSCo Meets LLMs: A Unified Approach for Sparse Retrieval and Contextual Distillation in Conversational Search},
  author={Lupart, Simon and Aliannejadi, Mohammad and Kanoulas, Evangelos},
  journal={arXiv preprint arXiv:2410.14609},
  year={2024}
}

Property	Details
Model Type	A conversational search adaptation of the original SPLADE++ (CoCondenser-EnsembleDistil) model
Training Data	QReCC dataset for multi - turn conversational search, with distillation from human and Mistral rewrites
Pipeline Tag	fill - mask
Tags	splade, conversational - search, multi - turn retrieval, query - expansion, document - expansion, passage - retrieval, knowledge - distillation

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご