S

Stella Pl Retrieval

Developed by sdadas
This is a text encoder based on stella_en_1.5B_v5 and further fine-tuned for Polish information retrieval tasks, specifically optimized for Polish information retrieval.
Downloads 913
Release Time : 9/28/2024

Model Overview

The model is adapted to Polish through multilingual knowledge distillation and fine-tuned with contrastive loss, converting text into 1024-dimensional vectors, making it particularly suitable for Polish information retrieval tasks.

Model Features

Polish optimization
Specifically optimized for Polish information retrieval tasks through multilingual knowledge distillation and contrastive loss fine-tuning.
Efficient retrieval
Uses 1024-dimensional vector representations for efficient information retrieval tasks.
Large-scale training
Trained with 20 million Polish-English text pairs for knowledge distillation and 1.4 million query data points for fine-tuning.

Model Capabilities

Text encoding
Information retrieval
Sentence similarity calculation

Use Cases

Information retrieval
Polish document retrieval
Retrieve relevant documents from a Polish document library
Achieved an NDCG@10 score of 62.32 in the Polish Information Retrieval Benchmark (PIRB)
Semantic analysis
Polish semantic similarity calculation
Calculate semantic similarity between Polish texts
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase