S

SGPT 1.3B Weightedmean Nli Bitfit

Developed by Muennighoff
SGPT is a sentence embedding model based on the GPT architecture, specifically designed for semantic search tasks, generating sentence representations through weighted mean pooling.
Downloads 206
Release Time : 3/2/2022

Model Overview

This model adopts the GPTNeo architecture and generates sentence embeddings via weighted mean pooling, primarily used for sentence similarity computation and semantic search tasks.

Model Features

Weighted Mean Pooling
Uses weighted mean pooling to generate sentence representations, better capturing semantic information.
Large-scale Pretraining
Fine-tuned on the 1.3B-parameter GPTNeo model, offering robust semantic understanding capabilities.
Efficient Fine-tuning
Employs bitfit technology for parameter-efficient fine-tuning, training only bias parameters.

Model Capabilities

Sentence Embedding Generation
Semantic Similarity Computation
Semantic Search

Use Cases

Information Retrieval
Document Retrieval
Find semantically relevant documents based on query statements.
Question Answering Systems
Question Matching
Match user questions with similar ones in the knowledge base.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase