TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Medical/Language Modelling/Hutter Prize

Language Modelling on Hutter Prize

Metric: Bit per Character (BPC) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Bit per Character (BPC)▼Extra DataPaperDate↕Code
1RHN - depth 5 [zilly2016recurrent]1.31NoRecurrent Highway Networks2016-07-12Code
2FS-LSTM-41.277NoFast-Slow Recurrent Neural Networks2017-05-24Code
3Large RHN1.27NoRecurrent Highway Networks2016-07-12Code
4Large FS-LSTM-41.245NoFast-Slow Recurrent Neural Networks2017-05-24Code
5Large mLSTM +emb +WN +VD1.24NoMultiplicative LSTM for sequence modelling2016-09-26Code
63-layer AWD-LSTM1.232NoAn Analysis of Neural Language Modeling at Multi...2018-03-22Code
7Mogrifier LSTM1.122NoMogrifier LSTM2019-09-04Code
812-layer Character Transformer Model1.11NoCharacter-Level Language Modeling with Deeper Se...2018-08-09Code
9mLSTM + dynamic eval1.08NoDynamic Evaluation of Neural Sequence Models2017-09-21Code
1064-layer Character Transformer Model1.06NoCharacter-Level Language Modeling with Deeper Se...2018-08-09Code
1112-layer Transformer-XL1.06YesTransformer-XL: Attentive Language Models Beyond...2019-01-09Code
1218-layer Transformer-XL1.03YesTransformer-XL: Attentive Language Models Beyond...2019-01-09Code
13Longformer Small1NoLongformer: The Long-Document Transformer2020-04-10Code
1424-layer Transformer-XL0.99NoTransformer-XL: Attentive Language Models Beyond...2019-01-09Code
15Longformer Large0.99NoLongformer: The Long-Document Transformer2020-04-10Code
16Mogrifier LSTM + dynamic eval0.988NoMogrifier LSTM2019-09-04Code
17Compressive Transformer0.97NoCompressive Transformers for Long-Range Sequence...2019-11-13Code
18Transformer-XL + RMS dynamic eval0.94NoDynamic Evaluation of Transformer Language Models2019-04-17Code