đ Bulgarian-Ukrainian Translation Model
This project focuses on the translation between Bulgarian and Ukrainian, offering a high - quality translation solution.
đ Quick Start
This model is designed for Bulgarian - Ukrainian translation. You can download the original weights and test sets from the provided links to start using the model.
⨠Features
- Language Pair: Supports translation from Bulgarian to Ukrainian.
- Model Type: Uses the
transformer - align
model.
- Pre - processing: Applies normalization and SentencePiece (spm32k, spm32k) for text pre - processing.
đĻ Installation
There is no specific installation steps provided in the original document.
đģ Usage Examples
No code examples are provided in the original document.
đ Documentation
Model Information
Property |
Details |
Model Type |
transformer - align |
Source Language |
Bulgarian (bul) |
Target Language |
Ukrainian (ukr) |
Pre - processing |
normalization + SentencePiece (spm32k, spm32k) |
Download Original Weights |
[opus - 2020 - 06 - 17.zip](https://object.pouta.csc.fi/Tatoeba - MT - models/bul - ukr/opus - 2020 - 06 - 17.zip) |
Test Set Translations |
[opus - 2020 - 06 - 17.test.txt](https://object.pouta.csc.fi/Tatoeba - MT - models/bul - ukr/opus - 2020 - 06 - 17.test.txt) |
Test Set Scores |
[opus - 2020 - 06 - 17.eval.txt](https://object.pouta.csc.fi/Tatoeba - MT - models/bul - ukr/opus - 2020 - 06 - 17.eval.txt) |
OPUS Readme |
[bul - ukr](https://github.com/Helsinki - NLP/Tatoeba - Challenge/tree/master/models/bul - ukr/README.md) |
Benchmarks
testset |
BLEU |
chr - F |
Tatoeba - test.bul.ukr |
49.2 |
0.683 |
System Info
- hf_name: bul - ukr
- source_languages: bul
- target_languages: ukr
- opus_readme_url: https://github.com/Helsinki - NLP/Tatoeba - Challenge/tree/master/models/bul - ukr/README.md
- original_repo: Tatoeba - Challenge
- tags: ['translation']
- languages: ['bg', 'uk']
- src_constituents: {'bul', 'bul_Latn'}
- tgt_constituents: {'ukr'}
- src_multilingual: False
- tgt_multilingual: False
- prepro: normalization + SentencePiece (spm32k, spm32k)
- url_model: https://object.pouta.csc.fi/Tatoeba - MT - models/bul - ukr/opus - 2020 - 06 - 17.zip
- url_test_set: https://object.pouta.csc.fi/Tatoeba - MT - models/bul - ukr/opus - 2020 - 06 - 17.test.txt
- src_alpha3: bul
- tgt_alpha3: ukr
- short_pair: bg - uk
- chrF2_score: 0.6829999999999999
- bleu: 49.2
- brevity_penalty: 0.983
- ref_len: 4932.0
- src_name: Bulgarian
- tgt_name: Ukrainian
- train_date: 2020 - 06 - 17
- src_alpha2: bg
- tgt_alpha2: uk
- prefer_old: False
- long_pair: bul - ukr
- helsinki_git_sha: 480fcbe0ee1bf4774bcbe6226ad9f58e63f6c535
- transformers_git_sha: 2207e5d8cb224e954a7cba69fa4ac2309e9ff30b
- port_machine: brutasse
- port_time: 2020 - 08 - 21 - 14:41
đ License
This project is licensed under the Apache - 2.0 license.