TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/MASS (6-layer Transformer)

MASS (6-layer Transformer)

Reported on 6 benchmarks across 1 task · 1 paper · 6 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing6 results

  • Machine TranslationonWMT2014 English-French
    BLEU· 2019-05-07
    37.5
    best: 38.27 (BERT-fused NMT)
    SOTA
    MASS: Masked Sequence to Sequence Pre-training for Language GenerationarXiv:1905.02450
  • Machine TranslationonWMT2014 French-English
    BLEU· 2019-05-07
    34.9
    best: 39.2 (GPT-3 175B (Few-Shot))
    SOTA
    MASS: Masked Sequence to Sequence Pre-training for Language GenerationarXiv:1905.02450
  • Machine TranslationonWMT2016 English-German
    BLEU· 2019-05-07
    28.3
    best: 29.7 (GPT-3 175B (Few-Shot))
    SOTA
    MASS: Masked Sequence to Sequence Pre-training for Language GenerationarXiv:1905.02450
  • Machine TranslationonWMT2016 Romanian-English
    BLEU· 2019-05-07
    33.1
    best: 39.5 (GPT-3 175B (Few-Shot))
    SOTA
    MASS: Masked Sequence to Sequence Pre-training for Language GenerationarXiv:1905.02450
  • Machine TranslationonWMT2016 German-English
    BLEU· 2019-05-07
    35.2
    best: 40.6 (GPT-3 175B (Few-Shot))
    SOTA
    MASS: Masked Sequence to Sequence Pre-training for Language GenerationarXiv:1905.02450
  • Machine TranslationonWMT2016 English-Romanian
    BLEU· 2019-05-07
    35.2
    SOTA
    MASS: Masked Sequence to Sequence Pre-training for Language GenerationarXiv:1905.02450