R

Ruropebert E5 Base 2k

Developed by Tochka-AI
A Russian sentence encoder model based on the RoPEBert architecture, supporting a context length of 2048 tokens and excelling in the encodechka benchmark tests.
Downloads 2,422
Release Time : 2/22/2024

Model Overview

A Russian sentence embedding model developed by Tochka AI, utilizing the RoPEBert architecture, primarily for feature extraction and sentence similarity calculation in Russian text.

Model Features

Long context support
Supports processing contexts up to 2048 tokens and can be extended to longer contexts.
Efficient attention mechanism
Supports SDPA efficient attention implementation to enhance processing speed.
RoPE scaling
Supports linear and dynamic RoPE scaling types, allowing for extended model context windows.
Built-in poolers
Includes built-in implementations of mean and first_token_transform poolers for direct sentence embedding extraction.

Model Capabilities

Russian text feature extraction
Sentence similarity calculation
Text classification
Long text processing

Use Cases

Text similarity
Sentence similarity calculation
Calculates semantic similarity between Russian sentences.
Measures sentence similarity via cosine similarity scores.
Text classification
Russian text classification
Can perform text classification tasks by adding a classification head.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase