Rugpt3medium Sum Gazeta
Russian abstractive summarization model based on rugpt3medium_based_on_gpt2, specifically trained on the Gazeta dataset
Downloads 1,228
Release Time : 3/2/2022
Model Overview
This is a Russian causal language model specifically designed for generating abstract summaries of news articles. Fine-tuned from the rugpt3medium model based on GPT-2 architecture, it performs exceptionally well on the Gazeta dataset.
Model Features
Russian language optimization
Specially trained and optimized for Russian text, performing well on Russian summarization tasks
Gazeta dataset adaptation
Fine-tuned using the Gazeta news dataset, making it particularly suitable for generating summaries of news articles
Abstractive summarization capability
Capable of generating abstract summaries of articles, not just extracting key sentences
Model Capabilities
Russian text comprehension
News article summarization
Long text processing (up to 600 tokens)
Use Cases
News media
Automated news summarization
Automatically generates concise summaries for news articles
Achieved 24.1 R-1-f score on Gazeta test set
Content aggregation
Multi-source news summarization
Generates uniformly formatted summaries for Russian news from different sources
Featured Recommended AI Models
Š 2025AIbase