TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/MMLU (College Biology)

Question Answering on MMLU (College Biology)

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1Med-PaLM 2 (ER)95.8NoTowards Expert-Level Medical Question Answering ...2023-05-16Code
2Med-PaLM 2 (CoT + SC)95.1NoTowards Expert-Level Medical Question Answering ...2023-05-16Code
3Med-PaLM 2 (5-shot)94.4NoTowards Expert-Level Medical Question Answering ...2023-05-16Code
4Chinchilla (few-shot, k=5)79.9NoGalactica: A Large Language Model for Science2022-11-16Code
5Gopher (few-shot, k=5)70.8NoGalactica: A Large Language Model for Science2022-11-16Code
6GAL 120B (zero-shot)68.8NoGalactica: A Large Language Model for Science2022-11-16Code
7OPT (few-shot, k=5)30.6NoGalactica: A Large Language Model for Science2022-11-16Code
8BLOOM (few-shot, k=5)28.5NoGalactica: A Large Language Model for Science2022-11-16Code