Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Question Answering
/
DROP Test
Question Answering on DROP Test
Metric: F1 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
F1
▼
Extra Data
Paper
Date
↕
Code
1
QDGAT (ensemble)
88.38
No
Question Directed Graph Attention Network for Nu...
2020-09-16
-
2
POET
87.6
Yes
Reasoning Like Program Executors
2022-01-27
Code
3
PaLM 2 (few-shot)
85
No
PaLM 2 Technical Report
2023-05-17
Code
4
BERT+Calculator (ensemble)
81.78
No
Giving BERT a Calculator: Finding Operations and...
2019-08-31
-
5
NeRd
81.71
No
-
-
-
6
GPT-4 (few-shot, k=3)
80.9
No
GPT-4 Technical Report
2023-03-15
Code
7
TASE-BERT
80.7
No
A Simple and Effective Model for Answering Multi...
2019-09-29
Code
8
MTMSN Large
79.88
No
A Multi-Type Multi-Span Network for Reading Comp...
2019-08-15
Code
9
GenBERT (+ND+TD)
72.4
No
Injecting Numerical Reasoning Skills into Langua...
2020-04-09
Code
10
NumNet
67.97
No
NumNet: Machine Reading Comprehension with Numer...
2019-10-15
Code
11
GPT 3.5 (few-shot, k=3)
64.1
No
GPT-4 Technical Report
2023-03-15
Code
12
Orca 2-7B
60.26
No
Orca 2: Teaching Small Language Models How to Re...
2023-11-18
-
13
Orca 2-13B
57.97
No
Orca 2: Teaching Small Language Models How to Re...
2023-11-18
-
14
NAQA Net
47.01
No
DROP: A Reading Comprehension Benchmark Requirin...
2019-03-01
Code
15
GPT-3 175B (few-shot, k=32)
36.5
No
Language Models are Few-Shot Learners
2020-05-28
Code
16
BERT
32.7
No
DROP: A Reading Comprehension Benchmark Requirin...
2019-03-01
Code