M

Modernbert Embed Base Legal Matryoshka 2

Developed by manishh16
A legal domain embedding model optimized based on the ModernBERT architecture, supporting multi-dimensional feature extraction and sentence similarity calculation
Downloads 14
Release Time : 3/28/2025

Model Overview

This model is a legal text embedding model optimized based on the ModernBERT architecture, specifically designed for sentence similarity calculation and feature extraction tasks in legal documents. It employs the MatryoshkaLoss training method and supports embedding representations of different dimensions.

Model Features

Multi-dimensional Embedding Support
Supports various embedding dimensions such as 768/512/256/128/64, allowing flexible selection based on application scenarios
Legal Domain Optimization
Specifically optimized for legal texts, better understanding legal terminology and document structures
Matryoshka Training Method
Uses the MatryoshkaLoss training strategy, ensuring good performance across different dimensions
Efficient Retrieval Capability
Excels in information retrieval tasks, particularly in legal document retrieval scenarios

Model Capabilities

Legal text feature extraction
Sentence similarity calculation
Information retrieval
Multi-dimensional embedding representation

Use Cases

Legal Document Processing
Legal Case Retrieval
Retrieve relevant legal cases based on query statements
Achieves 0.59 accuracy@1 at 768 dimensions
Contract Clause Matching
Match similar clauses in contracts
Achieves 0.69 accuracy@5 at 512 dimensions
Legal Research Assistance
Case Law Analysis
Analyze similar judgments in case law
Achieves 0.72 recall@10 at 256 dimensions
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase