S

SGPT 125M Weightedmean Msmarco Specb Bitfit

Developed by Muennighoff
SGPT-125M is a sentence transformer model optimized with weighted mean and bitfit techniques, focusing on sentence similarity tasks.
Downloads 4,086
Release Time : 3/2/2022

Model Overview

This model is primarily used for sentence similarity calculation and feature extraction, supporting multilingual text processing tasks.

Model Features

Multilingual support
Supports processing multiple languages including English, German, Spanish, French, Japanese, and Chinese.
Weighted mean technique
Uses weighted mean method to optimize sentence representation and improve similarity calculation performance.
Bitfit optimization
Employs bitfit technology for model fine-tuning to enhance performance on specific tasks.

Model Capabilities

Sentence similarity calculation
Text feature extraction
Multilingual text processing
Classification tasks
Clustering tasks
Retrieval tasks

Use Cases

E-commerce
Product review classification
Classify product reviews on platforms like Amazon.
Achieved 31.17% accuracy in English for MTEB Amazon review classification task
Counterfactual classification
Identify counterfactual statements in Amazon product descriptions.
Achieved 61.24% accuracy in English for MTEB Amazon counterfactual classification task
Academic research
Paper clustering
Cluster academic papers from arXiv and biorxiv.
Achieved V-measure of 39.71 in MTEB Arxiv clustering P2P task
Q&A systems
Duplicate question identification
Identify duplicate questions on AskUbuntu forums.
Achieved average precision of 55.84% in MTEB AskUbuntu duplicate questions task
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase