I

Instructor Base

Developed by hkunlp
A text embedding model based on the T5 architecture, focusing on sentence similarity calculation and text retrieval tasks, with excellent performance in multiple benchmark tests.
Downloads 13.22k
Release Time : 12/20/2022

Model Overview

This model is a text embedding model based on the T5 architecture, primarily used to generate high-quality sentence embedding vectors, supporting various natural language processing tasks such as information retrieval, text classification, clustering, and semantic similarity calculation.

Model Features

Excellent Multi-task Performance
Performs well in multiple tasks of the MTEB benchmark, including classification, clustering, and retrieval tasks.
Efficient Text Embedding
Capable of generating high-quality sentence embedding vectors, suitable for large-scale information retrieval scenarios.
Broad Applicability
Supports various downstream NLP tasks, including similarity calculation, classification, and clustering.

Model Capabilities

Sentence Similarity Calculation
Text Embedding Generation
Information Retrieval
Text Classification
Text Clustering
Semantic Search
Text Re-ranking

Use Cases

E-commerce
Product Review Classification
Sentiment analysis classification of Amazon product reviews
Achieved 88.36% accuracy in the AmazonPolarity classification task
Counterfactual Detection
Identifying counterfactual statements in Amazon product reviews
Achieved 86.21% accuracy in the AmazonCounterfactual classification task
Finance
Bank Customer Service Classification
Classification of bank customer inquiries
Achieved 77.04% accuracy in the Banking77 classification task
Academic Research
Paper Clustering
Topic clustering of arXiv and biorxiv papers
Achieved a 39.68 v_measure score in the ArxivClusteringP2P task
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase