Papers With Code 2 | ML Benchmarks, SotA Results & Code

A collection of datasets and benchmarks for large-scale Performance Modeling with LLMs.

This collection includes these dataset:

github-200K: a first version dataset that contains ~1300 kernel's execution time with input size - global size correlation and imbalanced data.
github-350k: a second version dataset continued from the first that fix the imbalanced data problem.
github-400k: a third version dataset continued from the second that relax the input - global size correlation.
github-600K: a fourth version dataset that contains ~6000 kernel's execution time with input size - global size correlation and have balanced data.
benchmark-[]: benchmarks for LLMs performance on Perfomance Modeling task.

opencl-llmperf