U

USER Base

Developed by deepvk
A sentence embedding extraction model specifically designed for Russian, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks like clustering or semantic search.
Downloads 2,337
Release Time : 6/10/2024

Model Overview

USER is a Russian universal sentence encoder based on sentence-transformers, specifically trained for Russian and applicable to various natural language processing tasks.

Model Features

Russian optimization
Specifically trained for Russian, excelling in Russian language tasks
Multi-stage training
Adopts a two-stage training process combining contrastive pre-training and model fusion techniques
Prompt optimization
Distinguishes different task types through query and passage prompts
Lightweight and efficient
Only 85M parameters, achieving optimal performance among models of similar scale

Model Capabilities

Sentence embedding extraction
Semantic similarity calculation
Text clustering
Information retrieval
Feature extraction

Use Cases

Information retrieval
Q&A systems
Used to match user queries with relevant document passages
Achieves a recall@100 of 0.763 on the MIRACL dataset
Text analysis
Semantic similarity calculation
Calculates semantic similarity between two sentences or paragraphs
Scores an average of 0.772 on the Encodechka benchmark
Text clustering
Automatically groups texts with similar content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase