E5 Base
E5-base is a general-purpose text embedding model suitable for various natural language processing tasks such as classification, retrieval, clustering, and semantic similarity calculation.
Downloads 30.85k
Release Time : 12/26/2022
Model Overview
E5-base is a Transformer-based text embedding model capable of converting text into high-dimensional vector representations, suitable for various downstream tasks.
Model Features
Multitask Support
Supports various natural language processing tasks, including classification, retrieval, clustering, and semantic similarity calculation.
High Performance
Performs excellently on multiple benchmark datasets, such as the MTEB dataset.
Versatility
Applicable to various text processing scenarios without requiring extensive task-specific adjustments.
Model Capabilities
Text classification
Text retrieval
Text clustering
Semantic similarity calculation
Text reranking
Use Cases
E-commerce
Product Review Classification
Classify Amazon product reviews to identify positive and negative feedback.
Achieves an accuracy of 87.96% on the MTEB AmazonPolarityClassification dataset.
Product Retrieval
Retrieve relevant products based on user queries.
Achieves an F1 score of 42.23 on the MTEB AmazonReviewsClassification dataset.
Academic Research
Paper Clustering
Cluster academic papers from arXiv and BioRxiv.
Achieves a V-measure of 44.57 on the MTEB ArxivClusteringP2P dataset.
Q&A Systems
Duplicate Question Detection
Detect duplicate questions in Q&A communities.
Achieves a MAP of 59.66 on the MTEB AskUbuntuDupQuestions dataset.
Featured Recommended AI Models