Past Decode Reg. + AWD-LSTM-MoS + dyn. eval.
Reported on 5 benchmarks across 1 task · 1 paper · 2 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Medical5 results
- Test perplexity· 2018-08-14SOTA40.3best: 8.21 (SparseGPT (175B, 50% Sparsity))
- Validation perplexity· 2018-08-14SOTA42best: 15.69 (GPT-2 (fine-tuned))
- Test perplexity· 2018-08-1447.3best: 20.5 (GPT-3 (Zero-Shot))
- Validation perplexity· 2018-08-1448best: 36.1 (BERT-Large-CAS)
- Bit per Character (BPC)· 2018-08-141.169best: 1.38 (Bipartite Flow)