Gte Qwen2 1.5B Instruct
A general-purpose text embedding model based on Qwen2-1.5B, supporting multilingual and long-text processing
Downloads 242.12k
Release Time : 6/29/2024
Model Overview
This model is the latest addition to the General Text Embedding (gte) series, focusing on generating high-quality text embeddings suitable for various NLP tasks such as information retrieval and semantic similarity calculation.
Model Features
Bidirectional Attention Mechanism
Enhances contextual understanding and improves embedding quality
Efficient Instruction Fine-Tuning
Only fine-tunes the query side for improved efficiency
Multilingual Support
Trained on cross-domain, multi-scenario multilingual text corpora
Long-Text Processing
Supports input lengths of up to 32k tokens
Model Capabilities
Text Embedding Generation
Semantic Similarity Calculation
Information Retrieval
Multilingual Text Processing
Use Cases
Information Retrieval
Web Search Query
Retrieves relevant document passages based on user queries
Achieved a score of 67.16 in MTEB evaluation
Semantic Similarity
Document Similarity Calculation
Computes semantic similarity between different documents
Featured Recommended AI Models