This repository was archived by the owner on Dec 11, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 2k
Reproducing result: WMT German-English bleu score is less than the half of the expected score #341
Copy link
Copy link
Open
Description
Thanks for sharing this great work!
Although, I strictly tried to follow the instructions in the ReadMe, I am unable the reproduce the WMT German-English benchmark results on newstest2015.
Here are my details:
- python 3.6.2, Tensorflow 1.5.1
- I used the provided
nmt/scripts/wmt16_en_de.shto download and pre-process the data files. - I patched the
nmt/standard_hparams/wmt16.jsonby adding two lines"num_encoder_layers": 4, "num_decoder_layers": 4,in order to avoid the problem described in Error loading pretrained model #264, and Update WMT16 standard hparams #265. - I used the following pre-trained models:
I got the following inference results for newstest_2015:
- deen_model_1 -- real bleu: 11.7, expected bleu: 27.6 (command to run inference:
python -m nmt.nmt --src=de --tgt=en --ckpt=deen_model_1/translate.ckpt --hparams_path=nmt/standard_hparams/wmt16.json --out_dir=deen_model_1_output --vocab_prefix=wmt16/vocab.bpe.32000 --inference_input_file=wmt16/newstest2015.tok.bpe.32000.de --inference_output_file=deen_model_1_output/output_infer --inference_ref_file=wmt16/newstest2015.tok.bpe.32000.en) - deen_model_2 -- real bleu: 11.8, expected bleu: 28.9 (command to run inference:
python -m nmt.nmt --src=de --tgt=en --ckpt=deen_model_2/translate.ckpt --hparams_path=nmt/standard_hparams/wmt16.json --out_dir=deen_model_2_output --vocab_prefix=wmt16/vocab.bpe.32000 --inference_input_file=wmt16/newstest2015.tok.bpe.32000.de --inference_output_file=deen_model_2_output/output_infer --inference_ref_file=wmt16/newstest2015.tok.bpe.32000.en)
Could you please provide any hint or help what am I doing wrong?
Thank you!
Metadata
Metadata
Assignees
Labels
No labels