WMT19 - English - Russian open - source English - Russian translation model - Free realization of mutual translation between English and Russian

Wmt19 En Ru

Developed by facebook

Facebook's WMT19 English-Russian neural machine translation model based on FairSeq, utilizing Transformer architecture

Supports Multiple LanguagesOpen Source License:Apache-2.0 #English-Russian translation #WMT19 benchmark #High-precision translation

Downloads 2,546

Release Time : 3/2/2022

Model Overview

This model is a neural machine translation system specifically designed for bidirectional English-Russian translation, trained on WMT19 competition data and achieving high-quality translation results with advanced Transformer architecture

Model Features

High-quality translation

Trained on WMT19 competition data, achieving SOTA level in English-Russian translation tasks

Transformer architecture

Utilizes advanced Transformer neural network architecture for better long-range dependency modeling

Multilingual support

Supports bidirectional translation between English and Russian

Model Capabilities

English to Russian text translation

Russian to English text translation

Long text translation processing

Use Cases

Content translation

News translation

Translate English news articles to Russian, or vice versa

Achieved 33.47 BLEU score on WMT19 test set

Technical document translation

Translate technical documents and instructional materials

Cross-language communication

Social media content translation

Translate social media posts and comments

🚀 FSMT

FSMT is a ported version of the fairseq wmt19 transformer for en - ru translation, offering four models for different language pairs.

🚀 Quick Start

from transformers import FSMTForConditionalGeneration, FSMTTokenizer
mname = "facebook/wmt19-en-ru"
tokenizer = FSMTTokenizer.from_pretrained(mname)
model = FSMTForConditionalGeneration.from_pretrained(mname)

input = "Machine learning is great, isn't it?"
input_ids = tokenizer.encode(input, return_tensors="pt")
outputs = model.generate(input_ids)
decoded = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(decoded) # Машинное обучение - это здорово, не так ли?

✨ Features

This is a ported version of fairseq wmt19 transformer for en - ru.
The abbreviation FSMT stands for FairSeqMachineTranslation.
All four models are available:
- [wmt19 - en - ru](https://huggingface.co/facebook/wmt19 - en - ru)
- [wmt19 - ru - en](https://huggingface.co/facebook/wmt19 - ru - en)
- [wmt19 - en - de](https://huggingface.co/facebook/wmt19 - en - de)
- [wmt19 - de - en](https://huggingface.co/facebook/wmt19 - de - en)

📚 Documentation

Model Description

This is a ported version of fairseq wmt19 transformer for en - ru. For more details, please see, Facebook FAIR's WMT19 News Translation Task Submission.

Intended Uses & Limitations

How to Use

from transformers import FSMTForConditionalGeneration, FSMTTokenizer
mname = "facebook/wmt19-en-ru"
tokenizer = FSMTTokenizer.from_pretrained(mname)
model = FSMTForConditionalGeneration.from_pretrained(mname)

input = "Machine learning is great, isn't it?"
input_ids = tokenizer.encode(input, return_tensors="pt")
outputs = model.generate(input_ids)
decoded = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(decoded) # Машинное обучение - это здорово, не так ли?

Limitations and Bias

The original (and this ported model) doesn't seem to handle well inputs with repeated sub - phrases, content gets truncated

Training Data

Pretrained weights were left identical to the original model released by fairseq. For more details, please, see the paper.

Eval Results

Property	Details
Model Pair	en - ru
Fairseq Score	36.4
Transformers Score	33.47

The score is slightly below the score reported by fairseq, since transformers currently doesn't support:

model ensemble, therefore the best performing checkpoint was ported (model4.pt).
re - ranking

The score was calculated using this code:

git clone https://github.com/huggingface/transformers
cd transformers
export PAIR=en-ru
export DATA_DIR=data/$PAIR
export SAVE_DIR=data/$PAIR
export BS=8
export NUM_BEAMS=15
mkdir -p $DATA_DIR
sacrebleu -t wmt19 -l $PAIR --echo src > $DATA_DIR/val.source
sacrebleu -t wmt19 -l $PAIR --echo ref > $DATA_DIR/val.target
echo $PAIR
PYTHONPATH="src:examples/seq2seq" python examples/seq2seq/run_eval.py facebook/wmt19-$PAIR $DATA_DIR/val.source $SAVE_DIR/test_translations.txt --reference_path $DATA_DIR/val.target --score_path $SAVE_DIR/test_bleu.json --bs $BS --task translation --num_beams $NUM_BEAMS

💡 Usage Tip

fairseq reports using a beam of 50, so you should get a slightly higher score if re - run with --num_beams 50.

Data Sources

BibTeX entry and citation info

@inproceedings{...,
  year={2020},
  title={Facebook FAIR's WMT19 News Translation Task Submission},
  author={Ng, Nathan and Yee, Kyra and Baevski, Alexei and Ott, Myle and Auli, Michael and Edunov, Sergey},
  booktitle={Proc. of WMT},
}

TODO

port model ensemble (fairseq uses 4 model checkpoints)

📄 License

This project is under the apache - 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご