đ rus-lit Translation Model
This project focuses on Russian to Lithuanian translation, offering a transformer - align model with specific pre - processing and detailed benchmark scores.
đ Quick Start
This model is designed for Russian to Lithuanian translation. You can download the original weights and test set translations from the provided links below.
⨠Features
- Translation Direction: Translates from Russian to Lithuanian.
- Model Type: Uses the
transformer - align
model.
- Pre - processing: Applies normalization and SentencePiece (spm32k, spm32k).
đĻ Installation
No specific installation steps are provided in the original README.
đģ Usage Examples
No code examples are provided in the original README.
đ Documentation
General Information
- Source Group: Russian
- Target Group: Lithuanian
- OPUS Readme: [rus - lit](https://github.com/Helsinki - NLP/Tatoeba - Challenge/tree/master/models/rus - lit/README.md)
Model Details
Property |
Details |
Model Type |
transformer - align |
Source Language(s) |
rus |
Target Language(s) |
lit |
Pre - processing |
normalization + SentencePiece (spm32k, spm32k) |
Download Original Weights |
[opus - 2020 - 06 - 17.zip](https://object.pouta.csc.fi/Tatoeba - MT - models/rus - lit/opus - 2020 - 06 - 17.zip) |
Test Set Translations |
[opus - 2020 - 06 - 17.test.txt](https://object.pouta.csc.fi/Tatoeba - MT - models/rus - lit/opus - 2020 - 06 - 17.test.txt) |
Test Set Scores |
[opus - 2020 - 06 - 17.eval.txt](https://object.pouta.csc.fi/Tatoeba - MT - models/rus - lit/opus - 2020 - 06 - 17.eval.txt) |
Benchmarks
testset |
BLEU |
chr - F |
Tatoeba - test.rus.lit |
43.5 |
0.675 |
System Info
Property |
Details |
hf_name |
rus - lit |
source_languages |
rus |
target_languages |
lit |
opus_readme_url |
https://github.com/Helsinki - NLP/Tatoeba - Challenge/tree/master/models/rus - lit/README.md |
original_repo |
Tatoeba - Challenge |
tags |
['translation'] |
languages |
['ru', 'lt'] |
src_constituents |
{'rus'} |
tgt_constituents |
{'lit'} |
src_multilingual |
False |
tgt_multilingual |
False |
prepro |
normalization + SentencePiece (spm32k, spm32k) |
url_model |
https://object.pouta.csc.fi/Tatoeba - MT - models/rus - lit/opus - 2020 - 06 - 17.zip |
url_test_set |
https://object.pouta.csc.fi/Tatoeba - MT - models/rus - lit/opus - 2020 - 06 - 17.test.txt |
src_alpha3 |
rus |
tgt_alpha3 |
lit |
short_pair |
ru - lt |
chrF2_score |
0.675 |
bleu |
43.5 |
brevity_penalty |
0.937 |
ref_len |
14406.0 |
src_name |
Russian |
tgt_name |
Lithuanian |
train_date |
2020 - 06 - 17 |
src_alpha2 |
ru |
tgt_alpha2 |
lt |
prefer_old |
False |
long_pair |
rus - lit |
helsinki_git_sha |
480fcbe0ee1bf4774bcbe6226ad9f58e63f6c535 |
transformers_git_sha |
2207e5d8cb224e954a7cba69fa4ac2309e9ff30b |
port_machine |
brutasse |
port_time |
2020 - 08 - 21 - 14:41 |
đ License
This project is licensed under the Apache - 2.0 license.