Edinburgh Neural Machine Translation Systems for WMT 16

Rico Sennrich, Barry Haddow, Alexandra Birch

2016-06-09WS 2016 8Machine Translation Translation

Abstract

We participated in the WMT 2016 shared news translation task by building neural translation systems for four language pairs, each trained in both directions: English<->Czech, English<->German, English<->Romanian and English<->Russian. Our systems are based on an attentional encoder-decoder, using BPE subword segmentation for open-vocabulary translation with a fixed vocabulary. We experimented with using automatic back-translations of the monolingual News corpus as additional training data, pervasive dropout, and target-bidirectional models. All reported methods give substantial improvements, and we see improvements of 4.3--11.2 BLEU over our baseline systems. In the human evaluation, our systems were the (tied) best constrained system for 7 out of 8 translation directions in which we participated.

Results

Task	Dataset	Metric	Value	Model
Machine Translation	WMT2016 English-Czech	BLEU score	25.8	Attentional encoder-decoder + BPE
Machine Translation	WMT2016 Romanian-English	BLEU score	33.3	Attentional encoder-decoder + BPE
Machine Translation	WMT2016 English-Russian	BLEU score	26	Attentional encoder-decoder + BPE
Machine Translation	WMT2016 English-German	BLEU score	34.2	Attentional encoder-decoder + BPE
Machine Translation	WMT2016 German-English	BLEU score	38.6	Attentional encoder-decoder + BPE
Machine Translation	WMT2016 Russian-English	BLEU score	28	Attentional encoder-decoder + BPE
Machine Translation	WMT2016 Czech-English	BLEU score	31.4	Attentional encoder-decoder + BPE
Machine Translation	WMT2016 English-Romanian	BLEU score	28.1	BiGRU

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17 Function-to-Style Guidance of LLMs for Code Translation2025-07-15 Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09 Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09 Unconditional Diffusion for Generative Sequential Recommendation2025-07-08 GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04 TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01 CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation2025-06-29