X

Xlm Roberta Ua Distilled

Developed by panalexeu
This is a fine-tuned sentence transformer model based on xlm-roberta-base, supporting English and Ukrainian, suitable for tasks like semantic textual similarity and semantic search.
Downloads 121
Release Time : 4/13/2025

Model Overview

The model maps sentences and paragraphs into a 768-dimensional dense vector space, applicable for tasks such as semantic textual similarity, semantic search, paraphrase mining, text classification, and clustering.

Model Features

Multilingual Support
Supports semantic understanding and similarity calculation for English and Ukrainian.
High-dimensional Vector Representation
Maps text to a 768-dimensional dense vector space, capturing rich semantic information.
Knowledge Distillation Training
Optimizes model performance through knowledge distillation methods.

Model Capabilities

Semantic textual similarity calculation
Cross-lingual semantic search
Text vectorization representation
Multilingual text classification
Text clustering analysis

Use Cases

Cross-lingual Information Retrieval
English-Ukrainian Document Search
Use English queries to retrieve Ukrainian documents.
Pearson similarity 0.5926 (sts17-en-ua dataset)
Semantic Similarity Analysis
Same-language Text Similarity Evaluation
Evaluate semantic similarity of English or Ukrainian text pairs.
English-English Spearman similarity 0.7308 (sts17-en-en dataset)
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase