R

Ruri Small

Developed by cl-nagoya
Ruri is a model specialized in Japanese text embedding, capable of efficiently calculating sentence similarity and extracting text features.
Downloads 11.75k
Release Time : 8/28/2024

Model Overview

This model is a general-purpose Japanese text embedding model, primarily used for sentence similarity calculation and feature extraction. Based on the DistilBert architecture, it supports a maximum sequence length of 512 tokens with an output dimension of 768.

Model Features

Efficient Japanese Processing
Optimized specifically for Japanese text, accurately understanding Japanese semantic features
High Performance
Outperforms similar models in JMTEB evaluations
Lightweight
A small model with only 68M parameters, suitable for resource-limited environments
Long Text Support
Supports a maximum sequence length of 512 tokens

Model Capabilities

Japanese Text Feature Extraction
Sentence Similarity Calculation
Semantic Search
Text Clustering

Use Cases

Information Retrieval
Semantic Search
Find relevant documents based on query semantics
Achieved a score of 69.41 in the JMTEB retrieval task
Text Analysis
Text Clustering
Group semantically similar texts together
Achieved a score of 51.19 in the JMTEB clustering task
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase