Reported on 189 benchmarks across 22 tasks · 15 papers · 35 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.