B

Bi Encoder Msmarco Bert Base German

Developed by PM-AI
Semantic search model trained on German version of MSMARCO dataset, optimized with hard negatives and Margin MSE loss function
Downloads 20.53k
Release Time : 11/23/2022

Model Overview

This model is specifically designed for German semantic search and document retrieval, capable of finding relevant passages based on queries. Trained on machine-translated German MSMARCO dataset with advanced training techniques for efficient retrieval.

Model Features

Hard Negative Training
Uses multi-system retrieval results as negative samples to enhance the model's ability to distinguish relevant passages
Margin MSE Loss Function
Guides bi-encoder training with cross-encoder to optimize similarity margin calculation
Asymmetric Search Optimization
Specifically optimized for query-passage asymmetric search scenarios
Cross-domain Applicability
Trained on multi-domain MSMARCO data to meet retrieval needs across different domains

Model Capabilities

Semantic search
Passage retrieval
Query-passage matching
Cross-domain information retrieval

Use Cases

Information Retrieval
Q&A Systems
Retrieve the most relevant answer passages based on user questions
Achieves NDCG@10 of 0.7196 on germandpr-beir test set
Document Search
Locate relevant content from large document collections
Outperforms traditional BM25 algorithm by approximately 34%
Enterprise Applications
Knowledge Base Retrieval
Quickly locate relevant information in enterprise knowledge bases
Featured Recommended AI Models
ยฉ 2025AIbase