M

Mmlw Retrieval Roberta Base

Developed by sdadas
MMLW (I Must Get Better News) is a Polish neural text encoder optimized for information retrieval tasks, capable of converting queries and passages into 768-dimensional vectors.
Downloads 408
Release Time : 10/18/2023

Model Overview

This model is a Polish sentence transformer primarily used for feature extraction and sentence similarity calculation, especially suitable for information retrieval tasks.

Model Features

Multilingual knowledge distillation
Trained on 60 million Polish-English text pairs using multilingual knowledge distillation, with English FlagEmbeddings (BGE) as the teacher model.
Contrastive loss fine-tuning
Fine-tuned on the Polish MS MARCO training set using contrastive loss, employing large batch sizes to improve training efficiency.
Specific prefix requirements
Encoding text requires specific prefixes and suffixes; queries must begin with the 'query: ' prefix.

Model Capabilities

Text encoding
Sentence similarity calculation
Information retrieval

Use Cases

Information retrieval
Health information retrieval
Retrieve the most relevant answers for health-related queries
Can accurately match healthy diet recommendations
Political information retrieval
Retrieve information related to political commitments
Can identify politically relevant texts
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase