T

Turkish Base Bert Uncased Mean Nli Stsb Tr

Developed by atasoglu
This is a sentence embedding model based on the Turkish BERT model, suitable for sentence similarity calculation and semantic search tasks.
Downloads 744
Release Time : 2/17/2024

Model Overview

This model can map Turkish sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks like clustering or semantic search. Fine-tuned from ytu-ce-cosmos/turkish-base-bert-uncased, trained on datasets including nli_tr and emrecan/stsb-mt-turkish.

Model Features

Turkish Language Optimization
Specially optimized and fine-tuned for Turkish, suitable for processing Turkish text.
Sentence Embedding
Can map sentences and paragraphs into a 768-dimensional dense vector space, preserving semantic information.
Lowercase Processing
Requires manual conversion of input text to lowercase, including special characters like 'I' to 'ı'.
High Performance
Excellent performance on the STS-b test set, achieving a cosine similarity Pearson score of 0.8401.

Model Capabilities

Sentence similarity calculation
Semantic search
Text clustering
Feature extraction

Use Cases

Information Retrieval
Semantic Search
Used to build a Turkish semantic search engine, returning documents semantically similar to the query.
Improves the relevance of search results
Text Analysis
Document Clustering
Automatically clusters Turkish documents to discover groups of similar documents.
Improves document organization efficiency
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase