Instructor Xl
I
Instructor Xl
Developed by retrainai
A T5 architecture-based sentence embedding model focused on semantic similarity and information retrieval tasks for English text.
Downloads 22
Release Time : 12/28/2023
Model Overview
This model is a T5 architecture-based sentence embedding model primarily used for computing sentence similarity, information retrieval, text classification, and clustering in natural language processing tasks. It performs exceptionally well on multiple standard datasets, particularly in semantic similarity and retrieval tasks.
Model Features
Multitask Performance
Excellent performance across various tasks including sentence similarity, information retrieval, text classification, and clustering.
Powerful Semantic Understanding
Based on the T5 architecture, it deeply understands text semantics and generates high-quality sentence embeddings.
Comprehensive Evaluation
Thoroughly evaluated on multiple standard datasets such as MTEB, validating its effectiveness.
Model Capabilities
Sentence similarity calculation
Information retrieval
Text classification
Text clustering
Feature extraction
Text reranking
Prompt retrieval
Use Cases
Information Retrieval
Question Answering System
Used to retrieve the most relevant answers to user questions.
Achieved map@100 of 38.79 on the CQADupstack dataset.
Document Retrieval
Retrieves relevant content from a large collection of documents.
Achieved ndcg@100 of 58.88 on the ArguAna dataset.
Text Classification
Sentiment Analysis
Classifies text into positive/negative sentiments.
Achieved an accuracy of 86.54% on the AmazonPolarity dataset.
Intent Recognition
Identifies the intent category of user queries.
Achieved an accuracy of 82.66% on the Banking77 dataset.
Semantic Similarity
Duplicate Question Detection
Identifies semantically similar questions.
Achieved a map of 65.35 on the AskUbuntuDupQuestions dataset.
Semantic Search
Search based on semantics rather than keyword matching.
Achieved a Spearman correlation of 84.15 for cosine similarity on the BIOSSES dataset.
Featured Recommended AI Models