M

Modernbert Base ColBERT

Developed by Y-J-Ju
This is a PyLate model fine-tuned from answerdotai/ModernBERT-base on the MS-MARCO dataset, designed for sentence similarity calculation and document retrieval.
Downloads 88
Release Time : 1/3/2025

Model Overview

The model maps sentences and paragraphs into 128-dimensional dense vector sequences, using the MaxSim operator for semantic text similarity calculation, suitable for information retrieval and re-ranking tasks.

Model Features

Efficient Retrieval
Utilizes Voyager HNSW index for fast document retrieval
Multi-vector Representation
Generates 128-dimensional dense vector sequences instead of single vectors, preserving more semantic information
Distillation Training
Trained with distillation loss function to enhance model performance

Model Capabilities

Semantic Similarity Calculation
Document Retrieval
Query Re-ranking
Feature Extraction

Use Cases

Information Retrieval
Document Search
Retrieve the most relevant documents from a collection based on queries
Performs well on standard retrieval datasets like MS-MARCO
Search Result Re-ranking
Refine the ranking of initial retrieval results
Can improve the accuracy and relevance of retrieval systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase