metabench - Paper Data

TablesCreative Commons Attribution

Item-wise accuracies in six benchmarks from Open LLM Leaderboard 1 scraped from huggingface.co and used for metabench analyses and construction. Datasets with RMSE's for random benchmark subsets are used as reference in the paper and are included here.

Please find the data uploaded on zenodo by clicking on "Homepage".