R

Ru Longformer Tiny 16384

Developed by kazzand
A tiny Longformer model specifically designed for Russian, supporting a context length of 16,384 tokens, initialized with rubert-tiny2 weights, suitable for Russian and English text processing.
Downloads 263
Release Time : 7/12/2023

Model Overview

This model is a Russian text processing model based on the Longformer architecture, optimized for long text processing, and can be used for generating text embeddings or fine-tuning for specific tasks.

Model Features

Long Text Processing Capability
Supports context lengths of up to 16,384 tokens, suitable for processing long documents and book content.
Bilingual Support
Initialized with rubert-tiny2 weights, capable of processing both Russian and English texts.
Lightweight Architecture
Adopts a tiny design with 12 attention heads and 3 hidden layers, computationally efficient.

Model Capabilities

Text embedding generation
Long text processing
Russian text understanding
English text understanding

Use Cases

Text Processing
Russian Book Content Analysis
Processing and analyzing long text content in Russian books.
News Article Summarization
Summarizing and extracting key information from Russian news articles.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase