Labse En Ru
A streamlined version of the LaBSE model specialized for English and Russian, significantly reducing model size while preserving original embedding quality
Downloads 375.34k
Release Time : 3/2/2022
Model Overview
This model is a lightweight version of LaBSE supporting only English and Russian, with vocabulary reduced to 10% of the original and parameters retained at 27%, fully maintaining original embedding vector quality, suitable for tasks like sentence similarity calculation
Model Features
Bilingual specialization
Retains only English and Russian tokens, reducing vocabulary to 10% of original with significant model size reduction
Quality-preserving compression
Maintains full original embedding quality for English and Russian while reducing model scale
Multilingual adaptation solution
Provides pruning solutions adaptable to other language combinations (refer to provided Colab notebook)
Model Capabilities
Generate sentence embeddings
Calculate sentence similarity
Support English and Russian text processing
Use Cases
Text similarity
Cross-lingual document retrieval
Establish semantic connections between English and Russian documents for cross-lingual retrieval
Maintains retrieval accuracy comparable to original LaBSE
Bilingual content matching
Identify semantic correspondences between English and Russian content
Feature extraction
Downstream task feature input
Provide pre-trained embedding features for classification, clustering and other tasks
Featured Recommended AI Models
ยฉ 2025AIbase