Weighted Transformer (large)

Reported on 2 benchmarks across 1 task · 1 paper · 2 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing2 results

Machine TranslationonWMT2014 English-German
BLEU score· 2017-11-06
28.9
best: 35.14 (Transformer Cycle (Rev))
SOTA
Weighted Transformer Network for Machine Translation arXiv:1711.02132
Machine TranslationonWMT2014 English-French
BLEU score· 2017-11-06
41.4
best: 46.4 (Transformer+BT (ADMIN init))
SOTA
Weighted Transformer Network for Machine Translation arXiv:1711.02132