Adaptively Sparse Transformer (1.5-entmax)
Reported on 3 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing3 results
- BLEU score· 2019-08-3033.1best: 40.3 (fast-noisy-channel-modeling)
- BLEU score· 2019-08-3029.83best: 29.9 (Adaptively Sparse Transformer (alpha-entmax))
- BLEU score· 2019-08-3025.89best: 35.14 (Transformer Cycle (Rev))