TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Knowledge Base/Mathematical Reasoning/Lila (IID)

Mathematical Reasoning on Lila (IID)

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1Codex (Few-Shot, 175B)0.604NoLila: A Unified Benchmark for Mathematical Reaso...2022-10-31Code
2Bhāskara-P (Fine-tuned, 2.7B)0.48NoLila: A Unified Benchmark for Mathematical Reaso...2022-10-31Code
3Neo-P (Fine-tuned, 2.7B)0.394NoLila: A Unified Benchmark for Mathematical Reaso...2022-10-31Code
4GPT-3 (Few-Shot, 175B)0.384NoLila: A Unified Benchmark for Mathematical Reaso...2022-10-31Code
5Bhāskara-A (Fine-tuned, 2.7B)0.252NoLila: A Unified Benchmark for Mathematical Reaso...2022-10-31Code
6Neo-A (Fine-tuned, 2.7B)0.204NoLila: A Unified Benchmark for Mathematical Reaso...2022-10-31Code