Metric: eval_loss (lower is better)
| # | Model↕ | eval_loss▲ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GPT2-Hermite | 2.91 | No | Polynomial, trigonometric, and tropical activati... | 2025-02-03 | Code |
| 2 | GPT2-Tropical | 2.92 | No | Polynomial, trigonometric, and tropical activati... | 2025-02-03 | Code |
| 3 | GPT2-Fourier | 2.93 | No | Polynomial, trigonometric, and tropical activati... | 2025-02-03 | Code |
| 4 | GPT2-GELU | 2.95 | No | Polynomial, trigonometric, and tropical activati... | 2025-02-03 | Code |