Open Source Dense Retrieval Model: dense_encoder-distilbert-frozen_emb - Freely Support Precise Information Search

Dense Encoder Distilbert Frozen Emb

Developed by vocab-transformers

Dense retrieval model based on DistilBERT architecture, trained on the MS MARCO dataset with frozen word embedding layers

Text Embedding

Transformers

#Information Retrieval Optimization #Frozen Word Embeddings #MarginMSE Loss

Downloads 26

Release Time : 4/5/2022

Model Overview

This model is a variant of DistilBERT, specifically optimized for information retrieval tasks, trained using the MarginMSE loss function, suitable for generating dense vector representations of documents and queries

Model Features

Frozen Word Embeddings Training

Keeps pre-trained word embedding layer parameters unchanged during training, potentially improving model stability

MarginMSE Optimization

Trained using the MarginMSE loss function, specifically optimizing ranking performance for retrieval tasks

Lightweight Architecture

Based on the DistilBERT architecture, smaller and faster than the original BERT model while maintaining good performance

Model Capabilities

Text Vector Representation

Semantic Similarity Calculation

Information Retrieval

Document Ranking

Use Cases

Search Engines

Web Search Result Ranking

Generates dense vector representations of queries and documents for search engines to use in relevance ranking

Performs well in standard retrieval evaluations such as TREC-DL

Question Answering Systems

Answer Passage Retrieval

Quickly retrieves passages related to questions from a large corpus of documents

Demonstrates stable performance on financial QA datasets like FiQA

Dataset	Model with updated token embeddings	Model with frozen embeddings
TREC - DL 19	70.68	68.60
TREC - DL 20	67.69	70.21
FiQA	28.89	28.60
Robust04	39.56	39.08
TREC - COVID v2	69.80	69.84
TREC - NEWS	37.97	38.27
Avg. 4 BEIR tasks	44.06	43.95

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Dense Encoder Distilbert Frozen Emb

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Dense Encoder - Distilbert - Frozen Token Embeddings

🚀 Quick Start

📚 Documentation

Performance Comparison