Question Answering on CoQA

Metric: Overall (higher is better)

LeaderboardDataset

Loading chart...

Results

Submit a result

Sort:

#	Model↕	Overall▼	Extra Data	Paper	Date↕	Code
1	GPT-3 175B (few-shot, k=32)	85	No	Language Models are Few-Shot Learners	2020-05-28	Code
2	BERT Large Augmented (single model)	81.1	No	BERT: Pre-training of Deep Bidirectional Transfo...	2018-10-11	Code
3	SDNet (ensemble)	79.3	No	SDNet: Contextualized Attention-based Deep Netwo...	2018-12-10	Code
4	BERT-base finetune (single model)	78.1	No	BERT: Pre-training of Deep Bidirectional Transfo...	2018-10-11	Code
5	SDNet (single model)	76.6	No	SDNet: Contextualized Attention-based Deep Netwo...	2018-12-10	Code
6	FlowQA (single model)	75	No	FlowQA: Grasping Flow in History for Conversatio...	2018-10-06	Code
7	BiDAF++ (single model)	67.8	No	A Qualitative Comparison of CoQA, SQuAD 2.0 and ...	2018-09-27	Code
8	DrQA + seq2seq with copy attention (single model)	65.1	No	CoQA: A Conversational Question Answering Challe...	2018-08-21	Code
9	Vanilla DrQA (single model)	52.6	No	CoQA: A Conversational Question Answering Challe...	2018-08-21	Code

#1GPT-3 175B (few-shot, k=32)SOTA
85
Overall· 2020-05-28
Language Models are Few-Shot Learners Code
#2BERT Large Augmented (single model)SOTA
81.1
Overall· 2018-10-11
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Code
#3SDNet (ensemble)
79.3
Overall· 2018-12-10
SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering Code
#4BERT-base finetune (single model)
78.1
Overall· 2018-10-11
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Code
#5SDNet (single model)
76.6
Overall· 2018-12-10
SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering Code
#6FlowQA (single model)SOTA
75
Overall· 2018-10-06
FlowQA: Grasping Flow in History for Conversational Machine Comprehension Code
#7BiDAF++ (single model)SOTA
67.8
Overall· 2018-09-27
A Qualitative Comparison of CoQA, SQuAD 2.0 and QuAC Code
#8DrQA + seq2seq with copy attention (single model)SOTA
65.1
Overall· 2018-08-21
CoQA: A Conversational Question Answering Challenge Code
#9Vanilla DrQA (single model)
52.6
Overall· 2018-08-21
CoQA: A Conversational Question Answering Challenge Code