TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Medical/Language Modelling/C4

Language Modelling on C4

Metric: Perplexity (lower is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Perplexity▲Extra DataPaperDate↕Code
1Primer12.35NoPrimer: Searching for Efficient Transformers for...2021-09-17Code
2Zeropoint LLM.int8 13B (vector-wise + decomp)12.45NoLLM.int8(): 8-bit Matrix Multiplication for Tran...2022-08-15Code
3T5++12.69NoPrimer: Searching for Efficient Transformers for...2021-09-17Code
4Original T513.25NoPrimer: Searching for Efficient Transformers for...2021-09-17Code
5LLM.float32 6.7B13.3NoLLM.int8(): 8-bit Matrix Multiplication for Tran...2022-08-15Code
6LLM.float32 2.7B14.43NoLLM.int8(): 8-bit Matrix Multiplication for Tran...2022-08-15Code
7N-Grammer 343M14.79NoN-Grammer: Augmenting Transformers with latent n...2022-07-13Code
8N-Grammer 288M15.01NoN-Grammer: Augmenting Transformers with latent n...2022-07-13Code
9LLM.float32 1.3B15.91NoLLM.int8(): 8-bit Matrix Multiplication for Tran...2022-08-15Code