Glucose Base Ja
G
Glucose Base Ja
Developed by pkshatech
GLuCoSE is a Japanese text embedding model based on LUKE, suitable for sentence similarity and semantic search tasks.
Downloads 70.71k
Release Time : 7/16/2023
Model Overview
GLuCoSE is a versatile and user-friendly Japanese text embedding model, constructed through mixed training on web data and various natural language inference and search-related datasets.
Model Features
Multi-task Training
Mixed training on web data and various natural language inference and search-related datasets enhances the model's versatility.
High-dimensional Output
Output dimension of 768, capable of capturing rich semantic information.
Long Text Support
Supports up to 512 tokens, suitable for processing longer Japanese texts.
Mean Pooling
Uses mean pooling to generate sentence embeddings, improving the stability of sentence representations.
Model Capabilities
Sentence Vector Similarity Calculation
Semantic Search
Japanese Text Feature Extraction
Use Cases
Information Retrieval
Document Search
Using semantic similarity to find relevant content in a document library
Achieves a Top-1 accuracy of 36.1% on the AIO3 development set
Natural Language Processing
Sentence Similarity Calculation
Calculating the semantic similarity between two Japanese sentences
Achieves a Spearman correlation coefficient of 0.864 on the JSTS development set
Featured Recommended AI Models