GPT-2 Large 774M (test-time training on nearest neighbors)
Reported on 1 benchmark across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Medical1 result
- Bits per byte· 2023-05-290.85best: 1.2253 (GPT-2 Small 124M (pre-trained))