Ruri Small V2
Ruri is a Japanese universal text embedding model focused on sentence similarity calculation and feature extraction, trained based on the cl-nagoya/ruri-pt-small-v2 foundation model.
Downloads 55.95k
Release Time : 12/5/2024
Model Overview
This model is primarily used for sentence similarity calculation and feature extraction of Japanese text, supporting the addition of query prefixes for semantic search tasks.
Model Features
Optimized Japanese Text Processing
Specially optimized for Japanese text, capable of accurately capturing Japanese semantic features
Prefix Awareness
Supports distinguishing between query and document text by adding 'クエリ:' (query:) and '文章:' (document:) prefixes
Efficient Performance
Achieves performance comparable to larger models with a parameter size of 68M
Model Capabilities
Japanese text embedding
Sentence similarity calculation
Semantic search
Feature extraction
Use Cases
Information Retrieval
Q&A System
Used to build Japanese Q&A systems, matching questions with relevant answers
Scored 73.94 in retrieval tasks on JMTEB evaluation
Text Analysis
Semantic Similarity Analysis
Calculates the semantic similarity between two Japanese text segments
Scored 82.91 in semantic similarity tasks on JMTEB
Featured Recommended AI Models