SGPT 1.3B Weightedmean Nli Bitfit
S
SGPT 1.3B Weightedmean Nli Bitfit
Developed by Muennighoff
SGPT is a sentence embedding model based on the GPT architecture, specifically designed for semantic search tasks, generating sentence representations through weighted mean pooling.
Downloads 206
Release Time : 3/2/2022
Model Overview
This model adopts the GPTNeo architecture and generates sentence embeddings via weighted mean pooling, primarily used for sentence similarity computation and semantic search tasks.
Model Features
Weighted Mean Pooling
Uses weighted mean pooling to generate sentence representations, better capturing semantic information.
Large-scale Pretraining
Fine-tuned on the 1.3B-parameter GPTNeo model, offering robust semantic understanding capabilities.
Efficient Fine-tuning
Employs bitfit technology for parameter-efficient fine-tuning, training only bias parameters.
Model Capabilities
Sentence Embedding Generation
Semantic Similarity Computation
Semantic Search
Use Cases
Information Retrieval
Document Retrieval
Find semantically relevant documents based on query statements.
Question Answering Systems
Question Matching
Match user questions with similar ones in the knowledge base.
Featured Recommended AI Models