S

SGPT 125M Weightedmean Nli Bitfit

Developed by Muennighoff
SGPT-125M is a sentence transformer model based on weighted mean and fine-tuned with Natural Language Inference (NLI), designed for sentence similarity calculation and feature extraction.
Downloads 326
Release Time : 3/2/2022

Model Overview

This model is primarily used for sentence similarity calculation and text feature extraction, with optimized multilingual text processing capabilities through weighted mean and NLI fine-tuning.

Model Features

Multi-task Evaluation Capability
Performs well on various tasks in the MTEB (Multi-task Evaluation Benchmark), including classification, clustering, and retrieval.
Multilingual Support
Supports text processing in multiple languages, including English, German, Spanish, French, Japanese, and Chinese.
Weighted Mean Optimization
Uses weighted mean method to optimize sentence representations, improving the accuracy of similarity calculations.
NLI Fine-tuning
Fine-tuned with Natural Language Inference (NLI) tasks to enhance semantic understanding capabilities.

Model Capabilities

Sentence similarity calculation
Text feature extraction
Multilingual text classification
Document clustering
Information retrieval
Search result reranking
Semantic textual similarity evaluation
Bilingual text mining

Use Cases

E-commerce
Amazon Review Classification
Classify multilingual product reviews on Amazon
English review classification accuracy 35.098%, German 24.516%, Spanish 29.098%
Counterfactual Classification
Identify counterfactual statements in Amazon reviews
English accuracy 65.88%, German 59.08%, Japanese 56.42%
Academic Research
arXiv Paper Clustering
Perform point-to-point and sentence-to-sentence clustering on arXiv academic papers
Point-to-point V-measure 34.74, sentence-to-sentence V-measure 24.68
biorxiv Paper Clustering
Cluster analysis on biorxiv biology papers
Point-to-point V-measure 28.93, sentence-to-sentence V-measure 23.08
Q&A Systems
AskUbuntu Duplicate Question Detection
Identify duplicate questions in the AskUbuntu forum
Average precision 52.63%, mean reciprocal rank 65.76%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase