A

Arabic English Sts Matryoshka V2.0

Developed by omarelshehy
A bilingual sentence transformer model fine-tuned based on FacebookAI/xlm-roberta-large, supporting semantic text similarity calculation for Arabic and English.
Downloads 1,072
Release Time : 10/16/2024

Model Overview

This is a bilingual (Arabic-English) sentence-transformers model, fine-tuned based on FacebookAI/xlm-roberta-large. It maps sentences and paragraphs to a 1024-dimensional dense vector space, suitable for tasks like semantic text similarity, semantic search, paraphrase mining, text classification, clustering, etc.

Model Features

Bilingual Support
Supports bilingual processing for Arabic and English, including cross-lingual semantic similarity calculation.
Matryoshka Embedding
Supports truncating embeddings to smaller sizes (1024, 768, 512, 256, 128, and 64) to optimize performance and memory usage.
High Performance
Excels on MTEB evaluation metrics, especially for Arabic-English (ar-en) benchmarks.

Model Capabilities

Semantic Text Similarity Calculation
Semantic Search
Paraphrase Mining
Text Classification
Text Clustering

Use Cases

Natural Language Processing
Cross-Lingual Document Retrieval
Perform semantic search and retrieval between Arabic and English documents.
Bilingual Text Classification
Classify Arabic and English texts.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase