SGPT 5.8B Weightedmean Nli Bitfit
S
SGPT 5.8B Weightedmean Nli Bitfit
Developed by Muennighoff
SGPT-5.8B is a sentence transformer model based on a 5.8B parameter scale, optimized through weighted mean and NLI (Natural Language Inference) fine-tuning, specifically designed for sentence similarity calculation and feature extraction tasks.
Downloads 86
Release Time : 3/2/2022
Model Overview
This model is primarily used for sentence similarity calculation and feature extraction, excelling in multiple tasks on the MTEB (Massive Text Embedding Benchmark), supporting multiple languages and cross-lingual tasks.
Model Features
Multilingual Support
Supports sentence similarity calculation and feature extraction in multiple languages, including English, German, French, Spanish, Japanese, and Chinese.
High Performance
Performs excellently in multiple MTEB tasks, especially achieving high scores in sentence similarity and classification tasks.
NLI Fine-tuning
Fine-tuned through Natural Language Inference (NLI) tasks, enhancing the model's capabilities in sentence similarity and semantic understanding.
Model Capabilities
Sentence similarity calculation
Feature extraction
Text classification
Cross-lingual text mining
Semantic retrieval
Use Cases
E-commerce
Product review classification
Used for sentiment analysis and classification of product reviews on e-commerce platforms like Amazon.
Achieved 82.31% accuracy in the MTEB Amazon polarity classification task.
Information Retrieval
Q&A systems
Used for similar question retrieval and answer matching in Q&A systems.
Achieved an average precision of 55.90 in the AskUbuntu duplicate question task.
Cross-lingual Applications
Bilingual text mining
Used for cross-lingual text matching and alignment tasks.
Achieved 75.49% accuracy in the BUCC German-English task.
Featured Recommended AI Models