TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FlowSeq: Non-Autoregressive Conditional Sequence Generatio...

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow

Xuezhe Ma, Chunting Zhou, Xi-An Li, Graham Neubig, Eduard Hovy

2019-09-05IJCNLP 2019 11Machine TranslationNMTTranslation
PaperPDFCode(official)Code

Abstract

Most sequence-to-sequence (seq2seq) models are autoregressive; they generate each token by conditioning on previously generated tokens. In contrast, non-autoregressive seq2seq models generate all tokens in one pass, which leads to increased efficiency through parallel processing on hardware such as GPUs. However, directly modeling the joint distribution of all tokens simultaneously is challenging, and even with increasingly complex model structures accuracy lags significantly behind autoregressive models. In this paper, we propose a simple, efficient, and effective model for non-autoregressive sequence generation using latent variable models. Specifically, we turn to generative flow, an elegant technique to model complex distributions using neural networks, and design several layers of flow tailored for modeling the conditional density of sequential latent variables. We evaluate this model on three neural machine translation (NMT) benchmark datasets, achieving comparable performance with state-of-the-art non-autoregressive NMT models and almost constant decoding time w.r.t the sequence length.

Results

TaskDatasetMetricValueModel
Machine TranslationIWSLT2015 German-EnglishBLEU score24.75FlowSeq-base
Machine TranslationWMT2016 Romanian-EnglishBLEU score32.91FlowSeq-large (NPD n = 30)
Machine TranslationWMT2016 Romanian-EnglishBLEU score32.46FlowSeq-large (NPD n = 15)
Machine TranslationWMT2016 Romanian-EnglishBLEU score32.03FlowSeq-large (IWD n = 15)
Machine TranslationWMT2016 Romanian-EnglishBLEU score30.69FlowSeq-large
Machine TranslationWMT2016 Romanian-EnglishBLEU score30.16FlowSeq-base
Machine TranslationWMT2014 German-EnglishBLEU score28.29FlowSeq-large (NPD n = 30)
Machine TranslationWMT2014 German-EnglishBLEU score27.71FlowSeq-large (NPD n = 15)
Machine TranslationWMT2014 German-EnglishBLEU score27.16FlowSeq-large (IWD n=15)
Machine TranslationWMT2014 German-EnglishBLEU score25.4FlowSeq-large
Machine TranslationWMT2014 German-EnglishBLEU score23.36FlowSeq-base
Machine TranslationWMT2014 English-GermanBLEU score23.64FlowSeq-large (NPD n = 30)
Machine TranslationWMT2014 English-GermanBLEU score23.14FlowSeq-large (NPD n = 15)
Machine TranslationWMT2014 English-GermanBLEU score22.94FlowSeq-large (IWD n = 15)
Machine TranslationWMT2014 English-GermanBLEU score20.85FlowSeq-large
Machine TranslationWMT2014 English-GermanBLEU score18.55FlowSeq-base
Machine TranslationWMT2016 English-RomanianBLEU score32.35FlowSeq-large (NPD n = 30)
Machine TranslationWMT2016 English-RomanianBLEU score31.97FlowSeq-large (NPD n=15)
Machine TranslationWMT2016 English-RomanianBLEU score31.08FlowSeq-large (IWD n = 15)
Machine TranslationWMT2016 English-RomanianBLEU score29.86FlowSeq-large
Machine TranslationWMT2016 English-RomanianBLEU score29.26FlowSeq-base

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09Unconditional Diffusion for Generative Sequential Recommendation2025-07-08GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation2025-06-29