Ruri Large V2
Ruri is a Japanese universal text embedding model, focusing on sentence similarity calculation and feature extraction, with support for long text processing.
Downloads 3,672
Release Time : 12/6/2024
Model Overview
This model is primarily used for Japanese sentence similarity calculation and text feature extraction, capable of generating high-quality text embeddings suitable for tasks such as information retrieval and cluster analysis.
Model Features
Long Text Support
Supports sequences up to 512 tokens, suitable for processing longer texts
High Performance
Excellent performance in JMTEB benchmark tests, with an average score of 74.55
Prefix Awareness
Can distinguish between query text and paragraph text, optimizing similarity calculation through specific prefixes
Model Capabilities
Japanese sentence similarity calculation
Text feature extraction
Information retrieval
Text clustering
Semantic search
Use Cases
Information Retrieval
Q&A System
Used to find the most relevant answer passages for user queries
Achieved a high score of 93.21 in reranking tasks
Text Analysis
Document Clustering
Automatically groups semantically similar documents
Scored 52.14 in clustering tasks
Featured Recommended AI Models