Language Modelling on The Pile

Metric: Test perplexity (lower is better)

LeaderboardDataset
Loading chart...