T

Tooka SBERT V2 Large

Developed by PartAI
A semantic text similarity and embedding model specifically designed for Persian, capable of mapping sentences into a dense vector space where semantically similar texts are positioned close to each other.
Downloads 127
Release Time : 5/13/2025

Model Overview

This model is a Sentence Transformers model for semantic text similarity and embedding tasks, available in both small and large sizes.

Model Features

Bilingual Support
Optimized specifically for Persian while also supporting English tasks
Two-stage Training
Adopts a two-stage training strategy of pre-training and fine-tuning to enhance model performance
Efficient Similarity Calculation
Capable of quickly calculating semantic similarity scores between sentences

Model Capabilities

Sentence similarity calculation
Text feature extraction
Semantic search
Information retrieval

Use Cases

Information Retrieval
Document Similarity Search
Finding semantically similar documents in Persian document collections
Achieved a retrieval task score of 59.80 on the PTEB benchmark
Text Classification
Sentiment Analysis
Performing sentiment classification on Persian texts
Achieved an average score of 74.73 on PTEB classification tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase