Difference-in-means
Reported on 1 benchmark across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Miscellaneous1 result
- Log odds-ratio (pythia-6.9b)· 2024-02-192.91best: 9.95 (DAS)