G

Glucose Base Ja V2

Developed by pkshatech
General-purpose Japanese text embedding model, optimized for retrieval tasks with excellent performance on CPUs
Downloads 25.25k
Release Time : 8/22/2024

Model Overview

A universal embedding model specialized in Japanese text processing, particularly excelling in retrieval tasks and sentence similarity calculations, suitable for query-based passage retrieval systems

Model Features

Retrieval task optimization
Demonstrates top performance among same-size models in retrieval tasks like MIRACL
Japanese-specific optimization
Specially optimized and trained for Japanese text processing
Lightweight and efficient
Supports CPU operation, suitable for resource-limited environments
Multi-stage training
Fine-tuned through integrated distillation and multi-stage contrastive learning

Model Capabilities

Sentence similarity calculation
Semantic retrieval
Feature extraction
Passage retrieval

Use Cases

Information retrieval
Enterprise knowledge base retrieval
Used for semantic retrieval systems in corporate knowledge bases
Achieves 85.5 Recall@5 on MIRACL-ja dataset
Question answering system
Building retrieval-based question answering systems
Achieves 60.6 nDCG@10 on JQaRA dataset
Text analysis
Text clustering
Semantic clustering analysis for Japanese texts
Semantic similarity calculation
Calculating semantic similarity between sentences
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase