Vikhrt5 3b
V
Vikhrt5 3b
Developed by Vikhrmodels
Russian-optimized model based on FLAN T5 3b, outperforming FRED T5XL
Downloads 35
Release Time : 12/17/2023
Model Overview
VikhrT5-3b is a Russian large language model optimized based on the FLAN T5 3b architecture, specializing in Russian natural language processing tasks and demonstrating excellent performance across multiple Russian benchmarks.
Model Features
Russian optimization
Specially optimized for Russian, excelling in Russian language tasks
Performance advantage
Outperforms FRED-T5-XL and FLAN-t5-xl models in multiple Russian benchmarks
Large parameter scale
3b parameters provide stronger language understanding and generation capabilities
Model Capabilities
Russian text understanding
Russian text generation
Russian question answering
Russian reasoning
Use Cases
Natural language processing
Russian knowledge QA
Answering Russian knowledge-based questions
Accuracy of 0.32 on ru_mmlu dataset
Russian language understanding
Russian natural language understanding tasks
Accuracy of 0.4280 on xnli_ru dataset
Russian reasoning
Russian contextual reasoning tasks
Accuracy of 0.71 on xwinograd_ru dataset
Featured Recommended AI Models