TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/MedMCQA

Question Answering on MedMCQA

Metric: Dev Set (Acc-%) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Dev Set (Acc-%)▼Extra DataPaperDate↕Code
1Meditron-70B (CoT + SC)66NoMEDITRON-70B: Scaling Medical Pretraining for La...2023-11-27Code
2Codex 5-shot CoT0.597NoCan large language models reason about medical q...2022-07-17Code
3VOD (BioLinkBERT)0.583NoVariational Open-Domain Question Answering2022-09-23Code
4Flan-PaLM (540B, SC)0.576NoLarge Language Models Encode Clinical Knowledge2022-12-26Code
5Flan-PaLM (540B, Few-shot)0.565NoLarge Language Models Encode Clinical Knowledge2022-12-26Code
6PaLM (540B, Few-shot)0.545NoLarge Language Models Encode Clinical Knowledge2022-12-26Code
7Flan-PaLM (540B, CoT)0.536NoLarge Language Models Encode Clinical Knowledge2022-12-26Code
8GAL 120B (zero-shot)0.529NoGalactica: A Large Language Model for Science2022-11-16Code
9Flan-PaLM (62B, Few-shot)0.462NoLarge Language Models Encode Clinical Knowledge2022-12-26Code
10PaLM (62B, Few-shot)0.434NoLarge Language Models Encode Clinical Knowledge2022-12-26Code
11PubmedBERT(Gu et al., 2022)0.4NoMedMCQA : A Large-scale Multi-Subject Multi-Choi...2022-03-27Code
12SciBERT (Beltagy et al., 2019)0.39NoMedMCQA : A Large-scale Multi-Subject Multi-Choi...2022-03-27Code
13BioBERT (Lee et al.,2020)0.38NoMedMCQA : A Large-scale Multi-Subject Multi-Choi...2022-03-27Code
14BERT (Devlin et al., 2019)-Base0.35NoMedMCQA : A Large-scale Multi-Subject Multi-Choi...2022-03-27Code
15Flan-PaLM (8B, Few-shot)0.345NoLarge Language Models Encode Clinical Knowledge2022-12-26Code
16BLOOM (few-shot, k=5)0.325NoGalactica: A Large Language Model for Science2022-11-16Code
17OPT (few-shot, k=5)0.296NoGalactica: A Large Language Model for Science2022-11-16Code
18PaLM (8B, Few-shot)0.267NoLarge Language Models Encode Clinical Knowledge2022-12-26Code