P

Pd Robert

Developed by viswadarshan06
A RoBERTa-base fine-tuned paraphrase detection model trained on four benchmark datasets: MRPC, QQP, PAWS-X, and PIT. Suitable for duplicate content detection, Q&A systems, and semantic similarity analysis.
Downloads 357
Release Time : 2/9/2025

Model Overview

This model is a Transformer-based sentence pair classifier specifically designed for detecting paraphrase relationships in English text, excelling in diverse linguistic structures.

Model Features

Multi-dataset joint training
Combines four benchmark datasets (MRPC, QQP, PAWS-X, and PIT) covering various scenarios including news, Q&A, and adversarial texts.
High robustness
Excellent performance on the adversarial PAWS-X dataset (F1 score 94.13%).
Strong domain adaptability
Supports further fine-tuning with domain-specific data (e.g., medical, legal).

Model Capabilities

Duplicate question detection
Semantic similarity analysis
Document deduplication
Q&A system optimization

Use Cases

Customer service
FAQ duplicate question identification
Automatically identifies semantically duplicate questions in user queries.
Reduces manual review workload.
Content moderation
Plagiarism detection
Identifies paraphrased plagiarized content.
Accuracy exceeds 90%.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase