rubert-base-cased-conversational-paraphrase-v1 Open-Source Model - Free Evaluation of Semantic Similarity of Russian Sentences

Rubert Base Cased Conversational Paraphrase V1

Developed by s-nlp

Russian sentence semantic similarity evaluation model based on ruBERT-conversational, trained on multiple paraphrase detection datasets

Text Classification

Transformers

Other#Russian sentence similarity #Multi-dataset fusion training #Semantic matching evaluation

Downloads 281

Release Time : 7/3/2022

Model Overview

This model is specifically designed for evaluating semantic similarity of Russian sentences, supporting tasks such as paraphrase detection and text classification. Based on the ruBERT-conversational architecture, fine-tuned on three Russian paraphrase detection datasets.

Model Features

Multi-dataset training

Jointly trained on three Russian paraphrase detection datasets: ru_paraphraser, RuPAWS, and detoxification dataset

Semantic similarity evaluation

Accurately evaluates semantic similarity between Russian sentences

Optimized training

Uses Adam optimizer with learning rate 1e-5, batch size 32, trained for 3 epochs

Model Capabilities

Russian sentence similarity calculation

Paraphrase detection

Text classification

Semantic analysis

Use Cases

Natural Language Processing

Paraphrase detection

Identifies Russian sentences with different expressions but the same meaning

Achieves ROC AUC of 0.848 on ru_paraphraser dataset

Q&A systems

Evaluates semantic similarity between user questions and knowledge base questions

Achieves ROC AUC of 0.761 on RuPAWS QQP dataset

Content moderation

Detects semantically similar but differently expressed inappropriate content

Achieves ROC AUC of 0.822 on detoxification dataset

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Rubert Base Cased Conversational Paraphrase V1

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Russian Paraphrase Detection Model

🚀 Quick Start

📦 Installation

💻 Usage Examples

📚 Documentation

Training Information

Performance

Related Model

🔧 Technical Details

📄 License