Sentence Luke Japanese Base Lite
This is a Japanese sentence embedding model based on the LUKE architecture, which has shown performance superior or equivalent to Japanese Sentence-BERT models in internal testing
Downloads 2,690
Release Time : 3/19/2023
Model Overview
This model is used to generate embedding vectors for Japanese sentences, suitable for tasks such as sentence similarity calculation and feature extraction
Model Features
Performance superior to Sentence-BERT
In internal testing, this model showed approximately 0.5 percentage points higher quantitative accuracy than Japanese Sentence-BERT models, with even better qualitative evaluation results
Based on LUKE architecture
Uses studio-ousia/luke-japanese-base-lite as the pre-training foundation, offering better contextual understanding
Sentence-level embedding
Specially optimized for sentence-level representation, ideal for sentence similarity calculation tasks
Model Capabilities
Japanese sentence embedding
Sentence similarity calculation
Feature extraction
Use Cases
Text similarity
Semantic search
Improves search results by calculating semantic similarity between queries and documents
Enhances relevance of search results
Duplicate content detection
Identifies texts with different expressions but similar semantics
Effectively detects duplicate or highly similar content
Information retrieval
Document clustering
Automatically groups documents based on semantic similarity
Achieves more accurate document classification and organization
Featured Recommended AI Models