Language Modelling on enwiki8

Metric: Bit per Character (BPC) (higher is better)

LeaderboardDataset
Loading chart...
#ModelBit per Character (BPC)Extra DataPaperDateCode
1PAR Transformer 24B1.11NoPay Attention when Required2020-09-09Code