TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/MedMCQA

Question Answering on MedMCQA

Metric: Test Set (Acc-%) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Test Set (Acc-%)▼Extra DataPaperDate↕Code
1Med-PaLM 2 (ER)0.723NoTowards Expert-Level Medical Question Answering ...2023-05-16Code
2Med-PaLM 2 (CoT+SC)0.715NoTowards Expert-Level Medical Question Answering ...2023-05-16Code
3Med-PaLM 2 (5-shot)0.713NoTowards Expert-Level Medical Question Answering ...2023-05-16Code
4VOD (BioLinkBERT)0.629NoVariational Open-Domain Question Answering2022-09-23Code
5Codex 5-shot CoT0.627NoCan large language models reason about medical q...2022-07-17Code
6BioMedGPT-10B0.514NoBioMedGPT: Open Multimodal Generative Pre-traine...2023-08-18Code
7PubmedBERT(Gu et al., 2022)0.41NoMedMCQA : A Large-scale Multi-Subject Multi-Choi...2022-03-27Code
8SciBERT (Beltagy et al., 2019)0.39NoMedMCQA : A Large-scale Multi-Subject Multi-Choi...2022-03-27Code
9BioBERT (Lee et al.,2020)0.37NoMedMCQA : A Large-scale Multi-Subject Multi-Choi...2022-03-27Code
10BERT (Devlin et al., 2019)-Base0.33NoMedMCQA : A Large-scale Multi-Subject Multi-Choi...2022-03-27Code