Text Generation on OpenWebText

Metric: eval_loss (lower is better)

LeaderboardDataset
Loading chart...
#Modeleval_lossExtra DataPaperDateCode
1GPT2-Hermite2.91NoPolynomial, trigonometric, and tropical activati...2025-02-03Code
2GPT2-81M-LOOP3.11NoLoop Neural Networks for Parameter Sharing2024-09-21-
3GPT2-124M3.12No--Code