GIST Large Embedding V0
A text embedding model fine-tuned based on BAAI/bge-large-en-v1.5, trained using the MEDI dataset and mined triplets from the MTEB classification task training set, capable of encoding retrieval queries directly without instructions.
Downloads 110.09k
Release Time : 2/14/2024
Model Overview
This model is primarily used for text embedding tasks, converting text into high-dimensional vector representations, suitable for scenarios such as information retrieval and semantic similarity computation.
Model Features
Instruction-Free
Directly encodes retrieval queries without the need for constructing prompt templates.
High Performance
Significant improvement in performance for most retrieval tasks.
Based on GISTEmbed Technology
Utilizes Guided In-batch Selection for Training Negatives technology to optimize embedding effectiveness.
Model Capabilities
Text Embedding
Semantic Similarity Computation
Information Retrieval
Use Cases
Information Retrieval
Document Retrieval
Used to retrieve documents semantically similar to the query.
Significant improvement in performance for most retrieval tasks.
Semantic Similarity Computation
Text Similarity Comparison
Computes the semantic similarity between two pieces of text.
Featured Recommended AI Models