Ruroberta Large Paraphrase V1
A Russian sentence similarity classification model based on ruRoberta-large, used to determine if two sentences are paraphrases
Downloads 942
Release Time : 7/2/2022
Model Overview
This model predicts the semantic equivalence of two Russian sentences, determining whether a text pair is a paraphrase (1) or not (0). Suitable for evaluating content preservation in text rewriting or style transfer.
Model Features
Multi-dataset joint training
Combines RuPAWS, ru_paraphraser, and detoxification datasets, covering various text rewriting scenarios
High-performance semantic matching
Achieves ROC AUC scores above 0.85 on multiple test sets, with a peak of 0.906
Robust architecture
Based on the powerful ruRoberta-large model, with excellent Russian language understanding capabilities
Model Capabilities
Russian sentence similarity calculation
Semantic equivalence judgment
Text rewriting content preservation evaluation
Style transfer effectiveness verification
Use Cases
Text processing
Paraphrase detection
Determine if two Russian sentences are paraphrases
Accurately identifies semantically equivalent expressions
Detoxification evaluation
Assess whether detoxified text retains original meaning
ROC AUC of 0.857
Quality assessment
Machine translation evaluation
Evaluate semantic consistency between different translation versions
Featured Recommended AI Models