Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Speech
/
Dialogue
/
Visual Dialog v1.0 test-std
Dialogue on Visual Dialog v1.0 test-std
Metric: R@10 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
R@10 (best first)
R@10 (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
R@10
▼
Extra Data
Paper
Date
↕
Code
1
Ensemble FGA + BERT
95.08
No
-
-
-
2
MRR ensemble (Naive)
94.45
No
-
-
-
3
5xFGA (F-RCNNx101)
94.05
No
Factor Graph Attention
2019-04-11
Code
4
single model
93.7
No
-
-
-
5
w/ VQA + CC, single model
93.25
No
-
-
-
6
test1
93.25
No
-
-
-
7
sh101
93.25
No
-
-
-
8
single-model
93.15
No
-
-
-
9
CAF
93.1
No
-
-
-
10
Transformer+2cons
92.5
No
-
-
-
11
SCL_48
92.27
No
-
-
-
12
single-model
92
No
-
-
-
13
Bert2constraints
91.97
No
-
-
-
14
clean_wac_4freeze
91.67
No
-
-
-
15
Two-Step(refactor)
90.83
No
-
-
-
16
MVAN
90.65
No
Multi-View Attention Network for Visual Dialog
2020-04-29
Code
17
1
90.6
No
-
-
-
18
jiuyigedian
90.38
No
-
-
-
19
disc
90.18
No
-
-
-
20
CAG
90.15
No
Iterative Context-Aware Graph Inference for Visu...
2020-04-05
Code
21
gr
90.05
No
-
-
-
22
zuizhong
90.03
No
-
-
-
23
CARE(Single Model)
89.95
No
-
-
-
24
211
89.83
No
-
-
-
25
RVA
89.83
No
Recursive Visual Attention in Visual Dialog
2018-12-06
Code
26
eightepoch
89.72
No
-
-
-
27
DualVD
89.7
No
DualVD: An Adaptive Dual Encoding Model for Deep...
2019-11-17
Code
28
zxcdd
89.65
No
-
-
-
29
2 Step: Factor Graph Attention + VD-Bert
89.6
No
Ensemble of MRR and NDCG models for Visual Dialog
2021-04-15
Code
30
Bert(two-stream)
89.6
No
-
-
-
31
ERIC666
89.6
No
-
-
-
32
kbgn_disc_5
89.48
No
-
-
-
33
HACAN
89.45
No
Making History Matter: History-Advantage Sequenc...
2019-02-25
-
34
DAN
89.35
No
Dual Attention Networks for Visual Reference Res...
2019-02-25
Code
35
lijunlin_9
89.25
No
-
-
-
36
lijunlin_7
89.15
No
-
-
-
37
CorefNMN (ResNet-152)
88.8
No
Visual Coreference Resolution in Visual Dialog u...
2018-09-06
Code
38
wqedasd(single model)
88.6
No
-
-
-
39
lkh(single-model)
88.35
No
-
-
-
40
adasd
87.9
No
-
-
-
41
GNN
87.83
No
Reasoning Visual Dialogs with Structural and Par...
2019-04-11
Code
42
DLC-4
87.42
No
-
-
-
43
NMN
86.88
No
Learning to Reason: End-to-End Module Networks f...
2017-04-18
Code
44
jkl
86.48
No
-
-
-
45
sdfsdaf
86.42
No
-
-
-
46
ensemble, finetune
84.52
No
-
-
-
47
P1P2+Distill+Ensemble
83.78
No
-
-
-
48
bert-double-stream-finetuning
83.33
No
-
-
-
49
MN-QIH-D
83.3
No
Visual Dialog
2016-11-26
Code
50
paratraining1epoch
83.1
No
-
-
-
51
MN-QIH-D
82.83
No
Visual Dialog
2016-11-26
Code
52
VD-PCR
82.75
No
-
-
-
53
Single
82.45
No
-
-
-
54
1
82.4
No
-
-
-
55
shanshandu
82.38
No
-
-
-
56
Ensemble + Fine-tuning
82.17
No
-
-
-
57
1
81.9
No
-
-
-
58
1
81.73
No
-
-
-
59
1
81.7
No
-
-
-
60
7
81.62
No
-
-
-
61
HRE-QIH-D
81.5
No
Visual Dialog
2016-11-26
Code
62
Ensemble + Finetune
80.65
No
Efficient Attention Mechanism for Visual Dialog ...
2019-11-26
Code
63
CE-finetuned, single model
80
No
-
-
-
64
Disc, Dense, 4 Ensemble.
79.77
No
-
-
-
65
gat_disc_relto_4
79.72
No
-
-
-
66
10
78.25
No
-
-
-
67
2
78.12
No
-
-
-
68
Ensemble
77.98
No
-
-
-
69
5-2
77.75
No
-
-
-
70
5_4
77.53
No
-
-
-
71
20
77.33
No
-
-
-
72
2
76.55
No
-
-
-
73
simple_test
74.67
No
-
-
-
74
gat_disc_3
74.15
No
-
-
-
75
5TS
70.12
No
-
-
-
76
mvan_len40_test
65.8
No
-
-
-
77
trainval_ch_9
61.7
No
-
-
-
78
czczx
53.05
No
-
-
-
79
qqhe
12.22
No
-
-
-
#1
Ensemble FGA + BERT
95.08
R@10
No paper
#2
MRR ensemble (Naive)
94.45
R@10
No paper
#3
5xFGA (F-RCNNx101)
SOTA
94.05
R@10
· 2019-04-11
Factor Graph Attention
Code
#4
single model
93.7
R@10
No paper
#5
w/ VQA + CC, single model
93.25
R@10
No paper
#6
test1
93.25
R@10
No paper
#7
sh101
93.25
R@10
No paper
#8
single-model
93.15
R@10
No paper
#9
CAF
93.1
R@10
No paper
#10
Transformer+2cons
92.5
R@10
No paper
#11
SCL_48
92.27
R@10
No paper
#12
single-model
92
R@10
No paper
#13
Bert2constraints
91.97
R@10
No paper
#14
clean_wac_4freeze
91.67
R@10
No paper
#15
Two-Step(refactor)
90.83
R@10
No paper
#16
MVAN
90.65
R@10
· 2020-04-29
Multi-View Attention Network for Visual Dialog
Code
#17
1
90.6
R@10
No paper
#18
jiuyigedian
90.38
R@10
No paper
#19
disc
90.18
R@10
No paper
#20
CAG
90.15
R@10
· 2020-04-05
Iterative Context-Aware Graph Inference for Visual Dialog
Code
#21
gr
90.05
R@10
No paper
#22
zuizhong
90.03
R@10
No paper
#23
CARE(Single Model)
89.95
R@10
No paper
#24
211
89.83
R@10
No paper
#25
RVA
SOTA
89.83
R@10
· 2018-12-06
Recursive Visual Attention in Visual Dialog
Code
#26
eightepoch
89.72
R@10
No paper
#27
DualVD
89.7
R@10
· 2019-11-17
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue
Code
#28
zxcdd
89.65
R@10
No paper
#29
2 Step: Factor Graph Attention + VD-Bert
89.6
R@10
· 2021-04-15
Ensemble of MRR and NDCG models for Visual Dialog
Code
#30
Bert(two-stream)
89.6
R@10
No paper
#31
ERIC666
89.6
R@10
No paper
#32
kbgn_disc_5
89.48
R@10
No paper
#33
HACAN
89.45
R@10
· 2019-02-25
Making History Matter: History-Advantage Sequence Training for Visual Dialog
#34
DAN
89.35
R@10
· 2019-02-25
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Code
#35
lijunlin_9
89.25
R@10
No paper
#36
lijunlin_7
89.15
R@10
No paper
#37
CorefNMN (ResNet-152)
SOTA
88.8
R@10
· 2018-09-06
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
Code
#38
wqedasd(single model)
88.6
R@10
No paper
#39
lkh(single-model)
88.35
R@10
No paper
#40
adasd
87.9
R@10
No paper
#41
GNN
87.83
R@10
· 2019-04-11
Reasoning Visual Dialogs with Structural and Partial Observations
Code
#42
DLC-4
87.42
R@10
No paper
#43
NMN
SOTA
86.88
R@10
· 2017-04-18
Learning to Reason: End-to-End Module Networks for Visual Question Answering
Code
#44
jkl
86.48
R@10
No paper
#45
sdfsdaf
86.42
R@10
No paper
#46
ensemble, finetune
84.52
R@10
No paper
#47
P1P2+Distill+Ensemble
83.78
R@10
No paper
#48
bert-double-stream-finetuning
83.33
R@10
No paper
#49
MN-QIH-D
SOTA
83.3
R@10
· 2016-11-26
Visual Dialog
Code
#50
paratraining1epoch
83.1
R@10
No paper
#51
MN-QIH-D
82.83
R@10
· 2016-11-26
Visual Dialog
Code
#52
VD-PCR
82.75
R@10
No paper
#53
Single
82.45
R@10
No paper
#54
1
82.4
R@10
No paper
#55
shanshandu
82.38
R@10
No paper
#56
Ensemble + Fine-tuning
82.17
R@10
No paper
#57
1
81.9
R@10
No paper
#58
1
81.73
R@10
No paper
#59
1
81.7
R@10
No paper
#60
7
81.62
R@10
No paper
#61
HRE-QIH-D
81.5
R@10
· 2016-11-26
Visual Dialog
Code
#62
Ensemble + Finetune
80.65
R@10
· 2019-11-26
Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs
Code
#63
CE-finetuned, single model
80
R@10
No paper
#64
Disc, Dense, 4 Ensemble.
79.77
R@10
No paper
#65
gat_disc_relto_4
79.72
R@10
No paper
#66
10
78.25
R@10
No paper
#67
2
78.12
R@10
No paper
#68
Ensemble
77.98
R@10
No paper
#69
5-2
77.75
R@10
No paper
#70
5_4
77.53
R@10
No paper
#71
20
77.33
R@10
No paper
#72
2
76.55
R@10
No paper
#73
simple_test
74.67
R@10
No paper
#74
gat_disc_3
74.15
R@10
No paper
#75
5TS
70.12
R@10
No paper
#76
mvan_len40_test
65.8
R@10
No paper
#77
trainval_ch_9
61.7
R@10
No paper
#78
czczx
53.05
R@10
No paper
#79
qqhe
12.22
R@10
No paper