TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/MMLU (Professional medicine)

Question Answering on MMLU (Professional medicine)

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1Med-PaLM 2 (5-shot)95.2NoTowards Expert-Level Medical Question Answering ...2023-05-16Code
2Med-PaLM 2 (CoT + SC)93.4NoTowards Expert-Level Medical Question Answering ...2023-05-16Code
3Med-PaLM 2 (ER)92.3NoTowards Expert-Level Medical Question Answering ...2023-05-16Code
4BioMedGPT-LM-7B51.1NoBioMedGPT: Open Multimodal Generative Pre-traine...2023-08-18Code
5Llama2-7B43.38NoLlama 2: Open Foundation and Fine-Tuned Chat Mod...2023-07-18Code
6Llama2-7B-chat40.07NoLlama 2: Open Foundation and Fine-Tuned Chat Mod...2023-07-18Code