A

Amber Base

Developed by retrieva-jp
Amber Base is a Japanese-English sentence encoder model based on modernbert-ja-130m, specializing in sentence similarity calculation and feature extraction tasks.
Downloads 213
Release Time : 3/7/2025

Model Overview

This model is primarily used for Japanese and English sentence similarity calculation, feature extraction, as well as text classification and clustering tasks. It performs well on the MTEB benchmark.

Model Features

Bilingual support
Supports sentence processing in both Japanese and English
Multi-task processing
Capable of handling various tasks including sentence similarity, feature extraction, classification, and clustering
MTEB benchmark validation
Demonstrates strong performance across multiple MTEB benchmarks

Model Capabilities

Sentence similarity calculation
Feature extraction
Text classification
Text clustering
Bilingual processing

Use Cases

Information retrieval
Cross-language document retrieval
Searching for similar content between Japanese and English documents
Achieved ndcg@10 of 48.068 in MTEB ArguAna test
Text analysis
Academic paper clustering
Topic clustering analysis of academic papers
Achieved v_measure of 55.655 in MTEB ArXivHierarchicalClusteringP2P test
Content classification
Counterfactual classification
Counterfactual classification of Amazon reviews
Achieved accuracy of 68.164% in MTEB AmazonCounterfactualClassification test
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase