L

Lt Wikidata Comp Multi

Developed by dell-research-harvard
A multilingual sentence similarity model fine-tuned based on sentence-transformers/paraphrase-multilingual-mpnet-base-v2, supporting semantic matching tasks in 12 languages
Downloads 415
Release Time : 8/29/2023

Model Overview

This model is specifically designed for record linkage (entity matching) tasks, suitable for clustering, deduplication, and association scenarios, supporting sentence similarity calculation in 12 languages including German, English, and Chinese

Model Features

Multilingual Support
Supports sentence similarity calculation in 12 major languages, covering major European and Asian languages
Entity Matching Optimization
Specially optimized for entity linking tasks such as company alias matching
Efficient Inference
Provides fast sentence embedding computation based on the optimized sentence-transformers framework

Model Capabilities

Multilingual sentence similarity calculation
Entity matching and linking
Text clustering analysis
Semantic search
Record deduplication

Use Cases

Enterprise Data Management
Company Name Standardization
Matching company name variants from different sources to standard names
Improves the cleanliness and consistency of enterprise databases
Multilingual Applications
Cross-language Document Retrieval
Finding semantically similar content in documents of different languages
Supports knowledge discovery in multilingual environments
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase