Labse En Ru Myv V1
The first neural machine translation system for the Erzya language, improved based on the LaBSE model, supporting sentence embedding representations for Russian and Erzya
Downloads 17
Release Time : 9/15/2022
Model Overview
This model is a cross-lingual sentence encoder improved based on LaBSE, specifically optimized for the Erzya language, and can be used for sentence embeddings, masked language prediction, and downstream natural language understanding tasks
Model Features
Cross-lingual embedding capability
Supports cross-lingual sentence embedding representations between Russian and Erzya
Specialized token optimization
Removes irrelevant tokens and adds Erzya-specific tokens to improve language processing effectiveness
Multi-task fine-tuning
Joint optimization through three tasks: cross-lingual distillation, masked language modeling, and sentence pair classification
Model Capabilities
Generate sentence embeddings
Masked language prediction
Cross-lingual sentence similarity calculation
Fine-tuning for natural language understanding tasks
Use Cases
Machine translation
Russian-Erzya translation
Used as the encoder component of a translation system
Semantic analysis
Cross-lingual document retrieval
Retrieve similar content in a mixed Russian and Erzya document library
Featured Recommended AI Models