C

Colbert ModernBERT Base Turkish Uncased

Developed by 99eren99
This is a Turkish language model fine-tuned from ModernBERT-base-Turkish-uncased-mlm using PyLate, designed for sentence similarity calculation and document reranking.
Downloads 74
Release Time : 2/14/2025

Model Overview

The model maps sentences and paragraphs into 128-dimensional dense vector sequences, supporting semantic text similarity computation using the MaxSim operator, suitable for Turkish text retrieval and reranking tasks.

Model Features

Long-context processing
Supports document processing up to 8192 tokens, suitable for long-text retrieval scenarios.
Efficient retrieval
Utilizes Voyager HNSW indexing for fast document retrieval.
Multi-granular representation
Generates 128-dimensional dense vector sequences, preserving fine-grained semantic information of text.

Model Capabilities

Semantic text similarity calculation
Document retrieval
Query-document matching
Search result reranking

Use Cases

Information retrieval
Document search engine
Building a Turkish document search engine to improve search result relevance
Improvement in nDCG and recall metrics
QA systems
Used for reranking answer candidates in question-answering systems
Increased answer accuracy
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase