G

Gte Modernbert Base

Developed by Alibaba-NLP
A text embedding model based on the ModernBERT pre-trained encoder, supporting long text processing up to 8192 tokens, with excellent performance on evaluation tasks such as MTEB, LoCO, and COIR.
Downloads 74.52k
Release Time : 1/20/2025

Model Overview

This model is a text embedding model developed by Alibaba Group's Tongyi Lab, specializing in English text processing and suitable for tasks such as information retrieval and semantic similarity calculation.

Model Features

Long Text Processing Capability
Supports input lengths of up to 8192 tokens, suitable for processing long documents
High Efficiency
Supports Flash Attention 2 acceleration, with high operational efficiency on GPUs
Multi-scenario Applicability
Performs excellently in various evaluation tasks such as MTEB, LoCO, and COIR

Model Capabilities

Text Embedding
Semantic Similarity Calculation
Information Retrieval
Long Document Processing

Use Cases

Information Retrieval
Document Retrieval
Quickly retrieve relevant content from large-scale document libraries
Achieved NDCG@10 of 88.88 in LoCO evaluation
Semantic Similarity
Question-Answer Matching
Calculate the semantic similarity between questions and candidate answers
Scored 81.57 in MTEB semantic similarity tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase