M

Mmlw Retrieval E5 Base

Developed by sdadas
MMLW (I Must Get Better Messages) is a Polish neural text encoder optimized for information retrieval tasks, capable of converting queries and passages into 768-dimensional vectors.
Downloads 144
Release Time : 10/18/2023

Model Overview

This model is a Polish sentence transformer primarily used for feature extraction and sentence similarity calculation, especially suitable for information retrieval tasks.

Model Features

Multilingual knowledge distillation
Trained using multilingual knowledge distillation method, utilizing English FlagEmbeddings as the teacher model
Contrastive loss fine-tuning
Fine-tuned on the Polish MS MARCO training set using contrastive loss with large batch size training
Specific prefix handling
Queries require adding 'query:' prefix and passages require adding 'passage:' prefix for optimal performance

Model Capabilities

Text encoding
Sentence similarity calculation
Information retrieval

Use Cases

Information retrieval
Q&A systems
Used to match user queries with relevant answer passages
Capable of accurately finding the most relevant answers to queries
Document retrieval
Searching for documents most relevant to specific queries in large document collections
Achieved NDCG@10 of 56.09 on the Polish Information Retrieval Benchmark (PIRB)
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase