R

Ruri Base V2

Developed by cl-nagoya
Ruri is a general-purpose text embedding model optimized for Japanese, based on the Sentence Transformers architecture, specifically designed for sentence similarity computation and feature extraction tasks.
Downloads 12.77k
Release Time : 12/5/2024

Model Overview

This model is primarily used for semantic similarity computation and feature extraction of Japanese text, supporting various natural language processing tasks such as retrieval, classification, and clustering.

Model Features

Japanese Optimization
Specially optimized for Japanese text, excelling in Japanese semantic understanding tasks
Prefix Awareness
Supports differentiated processing of queries and documents by adding prefixes (クエリ/文章) to enhance semantic understanding accuracy
Efficient Inference
Designed with a lightweight architecture, achieving fast inference while maintaining high performance
Multi-task Support
Delivers balanced performance across various tasks including retrieval, semantic similarity, classification, and re-ranking

Model Capabilities

Japanese Text Feature Extraction
Sentence Similarity Computation
Semantic Retrieval
Text Classification
Information Re-ranking
Text Clustering

Use Cases

Information Retrieval
Question Answering System
Used to match user queries with relevant answers in a knowledge base
Achieved 72.33 points in the JMTEB retrieval task
Document Similarity Analysis
Computes semantic similarity between documents for deduplication or recommendation
Achieved 83.03 points in the JMTEB semantic similarity task
Content Organization
Text Clustering
Automatically groups large volumes of text based on semantic similarity
Achieved 51.38 points in the JMTEB clustering task
Content Classification
Classifies text based on semantic features
Achieved 75.34 points in the JMTEB classification task
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase