Grave et al. (2016) - LSTM + continuous cache pointer
Reported on 1 benchmark across 1 task · 1 paper · 1 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Medical1 result
- Test perplexity· 2016-12-13SOTA68.9best: 8.21 (SparseGPT (175B, 50% Sparsity))