P

Pearl Small

Developed by Lihuchen
The Pearl Small Model is a lightweight string embedding model specifically designed for string semantic similarity calculation, generating high-quality embedding vectors for tasks such as string matching and entity retrieval.
Downloads 1,824
Release Time : 2/4/2024

Model Overview

This model integrates phrase type information and morphological features to more accurately capture variations in string forms. Fine-tuned based on E5-small, it can generate superior vector representations for phrases and strings.

Model Features

High-quality phrase representation
Learns high-quality universal phrase representations, outperforming traditional sentence embedding models.
Lightweight design
Only 34 million parameters, with low memory usage and fast inference speed.
Morphology-aware
Incorporates morphological features to accurately capture variations in string forms.

Model Capabilities

Calculate string semantic similarity
Generate phrase embedding vectors
Entity retrieval
String matching
Entity clustering
Fuzzy joining

Use Cases

Information retrieval
Entity linking
Link entities mentioned in text to standard entities in a knowledge base.
Achieved 48.1 points on the YAGO dataset.
String matching
Match strings from different sources that are semantically similar.
Achieved 97.0 points on the PPDB dataset.
Data integration
Fuzzy joining
Join records from different data sources that represent the same entity.
Achieved 75.2 points on the AutoFJ task.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase