đ cat-ukr
This project focuses on the translation between Catalan and Ukrainian, offering a reliable translation model and related evaluation data.
⨠Features
- Language Pair: It supports translation from Catalan (
cat
) to Ukrainian (ukr
).
- Model Type: The model used is
transformer-align
.
- Pre - processing: The pre - processing steps include normalization and SentencePiece (spm4k, spm4k).
đĻ Installation
No installation steps are provided in the original document.
đģ Usage Examples
No code examples are provided in the original document.
đ Documentation
General Information
- Source Group: Catalan
- Target Group: Ukrainian
- OPUS Readme: [cat - ukr](https://github.com/Helsinki - NLP/Tatoeba - Challenge/tree/master/models/cat - ukr/README.md)
Download Links
- Original Weights: [opus - 2020 - 06 - 16.zip](https://object.pouta.csc.fi/Tatoeba - MT - models/cat - ukr/opus - 2020 - 06 - 16.zip)
- Test Set Translations: [opus - 2020 - 06 - 16.test.txt](https://object.pouta.csc.fi/Tatoeba - MT - models/cat - ukr/opus - 2020 - 06 - 16.test.txt)
- Test Set Scores: [opus - 2020 - 06 - 16.eval.txt](https://object.pouta.csc.fi/Tatoeba - MT - models/cat - ukr/opus - 2020 - 06 - 16.eval.txt)
Benchmarks
Testset |
BLEU |
chr - F |
Tatoeba - test.cat.ukr |
28.6 |
0.503 |
System Info
Property |
Details |
hf_name |
cat - ukr |
source_languages |
cat |
target_languages |
ukr |
opus_readme_url |
[https://github.com/Helsinki - NLP/Tatoeba - Challenge/tree/master/models/cat - ukr/README.md](https://github.com/Helsinki - NLP/Tatoeba - Challenge/tree/master/models/cat - ukr/README.md) |
original_repo |
Tatoeba - Challenge |
tags |
['translation'] |
languages |
['ca', 'uk'] |
src_constituents |
{'cat'} |
tgt_constituents |
{'ukr'} |
src_multilingual |
False |
tgt_multilingual |
False |
prepro |
normalization + SentencePiece (spm4k, spm4k) |
url_model |
[https://object.pouta.csc.fi/Tatoeba - MT - models/cat - ukr/opus - 2020 - 06 - 16.zip](https://object.pouta.csc.fi/Tatoeba - MT - models/cat - ukr/opus - 2020 - 06 - 16.zip) |
url_test_set |
[https://object.pouta.csc.fi/Tatoeba - MT - models/cat - ukr/opus - 2020 - 06 - 16.test.txt](https://object.pouta.csc.fi/Tatoeba - MT - models/cat - ukr/opus - 2020 - 06 - 16.test.txt) |
src_alpha3 |
cat |
tgt_alpha3 |
ukr |
short_pair |
ca - uk |
chrF2_score |
0.503 |
bleu |
28.6 |
brevity_penalty |
0.9670000000000001 |
ref_len |
2438.0 |
src_name |
Catalan |
tgt_name |
Ukrainian |
train_date |
2020 - 06 - 16 |
src_alpha2 |
ca |
tgt_alpha2 |
uk |
prefer_old |
False |
long_pair |
cat - ukr |
helsinki_git_sha |
480fcbe0ee1bf4774bcbe6226ad9f58e63f6c535 |
transformers_git_sha |
2207e5d8cb224e954a7cba69fa4ac2309e9ff30b |
port_machine |
brutasse |
port_time |
2020 - 08 - 21 - 14:41 |
đ§ Technical Details
No technical details are provided in the original document.
đ License
This project is licensed under the [Apache - 2.0](https://www.apache.org/licenses/LICENSE - 2.0) license.