M

Modernce Large Sts

Developed by dleemiller
High-performance semantic similarity evaluation model optimized for text comparison tasks
Downloads 25
Release Time : 1/13/2025

Model Overview

This model is a cross-encoder based on the ModernBERT-large architecture, specifically designed for evaluating semantic similarity of text pairs, supporting long text processing with outstanding evaluation accuracy.

Model Features

Outstanding Performance
Achieves Pearson coefficient of 0.9256 and Spearman coefficient of 0.9215 on the STS-Benchmark test set
Efficient Architecture
Designed based on ModernBERT-large, with faster inference speed
Long Text Support
Supports processing sequences up to 8192 tokens, especially suitable for large language model output evaluation
Composite Training
Pre-trained on wiki-sim dataset followed by fine-tuning on stsb dataset

Model Capabilities

Semantic similarity evaluation
Text pair scoring
Long text processing

Use Cases

Natural Language Processing
Large Language Model Output Evaluation
Evaluates semantic similarity between LLM-generated text and reference text
Provides precise similarity scores in the 0-1 range
Question Answering Systems
Determines matching degree between questions and candidate answers
Improves accuracy of QA systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase