TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Convolutional Sequence to Sequence Learning

Convolutional Sequence to Sequence Learning

Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, Yann N. Dauphin

2017-05-08ICML 2017 8Machine TranslationImage ClassificationBangla Spelling Error CorrectionTranslation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce an architecture based entirely on convolutional neural networks. Compared to recurrent models, computations over all elements can be fully parallelized during training and optimization is easier since the number of non-linearities is fixed and independent of the input length. Our use of gated linear units eases gradient propagation and we equip each decoder layer with a separate attention module. We outperform the accuracy of the deep LSTM setup of Wu et al. (2016) on both WMT'14 English-German and WMT'14 English-French translation at an order of magnitude faster speed, both on GPU and CPU.

Results

TaskDatasetMetricValueModel
Machine TranslationIWSLT2015 German-EnglishBLEU score32.31ConvS2S
Machine TranslationIWSLT2015 English-GermanBLEU score26.73ConvS2S
Machine TranslationWMT2014 English-GermanBLEU score26.4ConvS2S (ensemble)
Machine TranslationWMT2014 English-GermanBLEU score25.16ConvS2S
Machine TranslationWMT2016 English-RomanianBLEU score29.9ConvS2S BPE40k
Machine TranslationWMT2014 English-FrenchBLEU score41.3ConvS2S (ensemble)
Machine TranslationWMT2014 English-FrenchBLEU score40.46ConvS2S
Image ClassificationMNISTAccuracy98.59CNN Model by Som
Image ClassificationMNISTPercentage error1.41CNN Model by Som

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Function-to-Style Guidance of LLMs for Code Translation2025-07-15