Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Question Answering
/
MedMCQA
Question Answering on MedMCQA
Metric: Dev Set (Acc-%) (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
#
Model
↕
Dev Set (Acc-%)
▼
Extra Data
Paper
Date
↕
Code
1
Meditron-70B (CoT + SC)
66
No
MEDITRON-70B: Scaling Medical Pretraining for La...
2023-11-27
Code
2
Codex 5-shot CoT
0.597
No
Can large language models reason about medical q...
2022-07-17
Code
3
VOD (BioLinkBERT)
0.583
No
Variational Open-Domain Question Answering
2022-09-23
Code
4
Flan-PaLM (540B, SC)
0.576
No
Large Language Models Encode Clinical Knowledge
2022-12-26
Code
5
Flan-PaLM (540B, Few-shot)
0.565
No
Large Language Models Encode Clinical Knowledge
2022-12-26
Code
6
PaLM (540B, Few-shot)
0.545
No
Large Language Models Encode Clinical Knowledge
2022-12-26
Code
7
Flan-PaLM (540B, CoT)
0.536
No
Large Language Models Encode Clinical Knowledge
2022-12-26
Code
8
GAL 120B (zero-shot)
0.529
No
Galactica: A Large Language Model for Science
2022-11-16
Code
9
Flan-PaLM (62B, Few-shot)
0.462
No
Large Language Models Encode Clinical Knowledge
2022-12-26
Code
10
PaLM (62B, Few-shot)
0.434
No
Large Language Models Encode Clinical Knowledge
2022-12-26
Code
11
PubmedBERT(Gu et al., 2022)
0.4
No
MedMCQA : A Large-scale Multi-Subject Multi-Choi...
2022-03-27
Code
12
SciBERT (Beltagy et al., 2019)
0.39
No
MedMCQA : A Large-scale Multi-Subject Multi-Choi...
2022-03-27
Code
13
BioBERT (Lee et al.,2020)
0.38
No
MedMCQA : A Large-scale Multi-Subject Multi-Choi...
2022-03-27
Code
14
BERT (Devlin et al., 2019)-Base
0.35
No
MedMCQA : A Large-scale Multi-Subject Multi-Choi...
2022-03-27
Code
15
Flan-PaLM (8B, Few-shot)
0.345
No
Large Language Models Encode Clinical Knowledge
2022-12-26
Code
16
BLOOM (few-shot, k=5)
0.325
No
Galactica: A Large Language Model for Science
2022-11-16
Code
17
OPT (few-shot, k=5)
0.296
No
Galactica: A Large Language Model for Science
2022-11-16
Code
18
PaLM (8B, Few-shot)
0.267
No
Large Language Models Encode Clinical Knowledge
2022-12-26
Code