Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Visual Dialog
/
Visual Dialog v1.0 test-std
Visual Dialog on Visual Dialog v1.0 test-std
Metric: R@5 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
R@5 (best first)
R@5 (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
R@5
▼
Extra Data
Paper
Date
↕
Code
1
Ensemble FGA + BERT
88.42
No
-
-
-
2
MRR ensemble (Naive)
87.55
No
-
-
-
3
5xFGA (F-RCNNx101)
86.73
No
Factor Graph Attention
2019-04-11
Code
4
single model
85.05
No
-
-
-
5
sh101
85.02
No
-
-
-
6
CAF
84.95
No
-
-
-
7
w/ VQA + CC, single model
84.67
No
-
-
-
8
test1
84.67
No
-
-
-
9
Transformer+2cons
84.12
No
-
-
-
10
SCL_48
84.1
No
-
-
-
11
Two-Step(refactor)
83.85
No
-
-
-
12
single-model
83.15
No
-
-
-
13
Bert2constraints
82.97
No
-
-
-
14
clean_wac_4freeze
82.23
No
-
-
-
15
2 Step: Factor Graph Attention + VD-Bert
81.55
No
Ensemble of MRR and NDCG models for Visual Dialog
2021-04-15
Code
16
single-model
81.55
No
-
-
-
17
zuizhong
81.25
No
-
-
-
18
MVAN
81.12
No
Multi-View Attention Network for Visual Dialog
2020-04-29
Code
19
jiuyigedian
81
No
-
-
-
20
ERIC666
81
No
-
-
-
21
1
80.92
No
-
-
-
22
gr
80.92
No
-
-
-
23
disc
80.83
No
-
-
-
24
zxcdd
80.8
No
-
-
-
25
211
80.77
No
-
-
-
26
HACAN
80.63
No
Making History Matter: History-Advantage Sequenc...
2019-02-25
-
27
CAG
80.63
No
Iterative Context-Aware Graph Inference for Visu...
2020-04-05
Code
28
lijunlin_9
80.45
No
-
-
-
29
eightepoch
80.45
No
-
-
-
30
Synergistic
80.43
No
Image-Question-Answer Synergistic Network for Vi...
2019-02-26
-
31
RVA
80.4
No
Recursive Visual Attention in Visual Dialog
2018-12-06
Code
32
CARE(Single Model)
80.35
No
-
-
-
33
DualVD
80.23
No
DualVD: An Adaptive Dual Encoding Model for Deep...
2019-11-17
Code
34
kbgn_disc_5
80.1
No
-
-
-
35
DAN
79.75
No
Dual Attention Networks for Visual Reference Res...
2019-02-25
Code
36
Bert(two-stream)
79.53
No
-
-
-
37
lijunlin_7
79.47
No
-
-
-
38
DLC-4
78.22
No
-
-
-
39
lkh(single-model)
78.1
No
-
-
-
40
CorefNMN (ResNet-152)
78.1
No
Visual Coreference Resolution in Visual Dialog u...
2018-09-06
Code
41
wqedasd(single model)
78
No
-
-
-
42
GNN
77.98
No
Reasoning Visual Dialogs with Structural and Par...
2019-04-11
Code
43
adasd
77.53
No
-
-
-
44
NMN
76.88
No
Learning to Reason: End-to-End Module Networks f...
2017-04-18
Code
45
jkl
76.78
No
-
-
-
46
sdfsdaf
76.15
No
-
-
-
47
paratraining1epoch
73.45
No
-
-
-
48
MN-QIH-D
72.45
No
Visual Dialog
2016-11-26
Code
49
MN-QIH-D
72.3
No
Visual Dialog
2016-11-26
Code
50
bert-double-stream-finetuning
70.75
No
-
-
-
51
HRE-QIH-D
70.45
No
Visual Dialog
2016-11-26
Code
52
ensemble, finetune
70.23
No
-
-
-
53
gat_disc_relto_4
70.17
No
-
-
-
54
shanshandu
70.15
No
-
-
-
55
1
69.95
No
-
-
-
56
1
69.65
No
-
-
-
57
1
68.92
No
-
-
-
58
P1P2+Distill+Ensemble
68.9
No
-
-
-
59
1
68.67
No
-
-
-
60
7
68.45
No
-
-
-
61
VD-PCR
68.4
No
-
-
-
62
Ensemble + Fine-tuning
68.12
No
-
-
-
63
Disc, Dense, 4 Ensemble.
67.65
No
-
-
-
64
Ensemble + Finetune
66.6
No
Efficient Attention Mechanism for Visual Dialog ...
2019-11-26
Code
65
gat_disc_3
65.85
No
-
-
-
66
Single
65.7
No
-
-
-
67
10
64.15
No
-
-
-
68
CE-finetuned, single model
64.12
No
-
-
-
69
20
63.28
No
-
-
-
70
5_4
62.98
No
-
-
-
71
5-2
62.88
No
-
-
-
72
Ensemble
62.82
No
-
-
-
73
2
62.42
No
-
-
-
74
2
60.38
No
-
-
-
75
simple_test
60.12
No
-
-
-
76
mvan_len40_test
56.47
No
-
-
-
77
trainval_ch_9
54.97
No
-
-
-
78
5TS
53.62
No
-
-
-
79
czczx
43.58
No
-
-
-
80
qqhe
7.22
No
-
-
-
#1
Ensemble FGA + BERT
88.42
R@5
No paper
#2
MRR ensemble (Naive)
87.55
R@5
No paper
#3
5xFGA (F-RCNNx101)
SOTA
86.73
R@5
· 2019-04-11
Factor Graph Attention
Code
#4
single model
85.05
R@5
No paper
#5
sh101
85.02
R@5
No paper
#6
CAF
84.95
R@5
No paper
#7
w/ VQA + CC, single model
84.67
R@5
No paper
#8
test1
84.67
R@5
No paper
#9
Transformer+2cons
84.12
R@5
No paper
#10
SCL_48
84.1
R@5
No paper
#11
Two-Step(refactor)
83.85
R@5
No paper
#12
single-model
83.15
R@5
No paper
#13
Bert2constraints
82.97
R@5
No paper
#14
clean_wac_4freeze
82.23
R@5
No paper
#15
2 Step: Factor Graph Attention + VD-Bert
81.55
R@5
· 2021-04-15
Ensemble of MRR and NDCG models for Visual Dialog
Code
#16
single-model
81.55
R@5
No paper
#17
zuizhong
81.25
R@5
No paper
#18
MVAN
81.12
R@5
· 2020-04-29
Multi-View Attention Network for Visual Dialog
Code
#19
jiuyigedian
81
R@5
No paper
#20
ERIC666
81
R@5
No paper
#21
1
80.92
R@5
No paper
#22
gr
80.92
R@5
No paper
#23
disc
80.83
R@5
No paper
#24
zxcdd
80.8
R@5
No paper
#25
211
80.77
R@5
No paper
#26
HACAN
SOTA
80.63
R@5
· 2019-02-25
Making History Matter: History-Advantage Sequence Training for Visual Dialog
#27
CAG
80.63
R@5
· 2020-04-05
Iterative Context-Aware Graph Inference for Visual Dialog
Code
#28
lijunlin_9
80.45
R@5
No paper
#29
eightepoch
80.45
R@5
No paper
#30
Synergistic
80.43
R@5
· 2019-02-26
Image-Question-Answer Synergistic Network for Visual Dialog
#31
RVA
SOTA
80.4
R@5
· 2018-12-06
Recursive Visual Attention in Visual Dialog
Code
#32
CARE(Single Model)
80.35
R@5
No paper
#33
DualVD
80.23
R@5
· 2019-11-17
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue
Code
#34
kbgn_disc_5
80.1
R@5
No paper
#35
DAN
79.75
R@5
· 2019-02-25
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Code
#36
Bert(two-stream)
79.53
R@5
No paper
#37
lijunlin_7
79.47
R@5
No paper
#38
DLC-4
78.22
R@5
No paper
#39
lkh(single-model)
78.1
R@5
No paper
#40
CorefNMN (ResNet-152)
SOTA
78.1
R@5
· 2018-09-06
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
Code
#41
wqedasd(single model)
78
R@5
No paper
#42
GNN
77.98
R@5
· 2019-04-11
Reasoning Visual Dialogs with Structural and Partial Observations
Code
#43
adasd
77.53
R@5
No paper
#44
NMN
SOTA
76.88
R@5
· 2017-04-18
Learning to Reason: End-to-End Module Networks for Visual Question Answering
Code
#45
jkl
76.78
R@5
No paper
#46
sdfsdaf
76.15
R@5
No paper
#47
paratraining1epoch
73.45
R@5
No paper
#48
MN-QIH-D
SOTA
72.45
R@5
· 2016-11-26
Visual Dialog
Code
#49
MN-QIH-D
72.3
R@5
· 2016-11-26
Visual Dialog
Code
#50
bert-double-stream-finetuning
70.75
R@5
No paper
#51
HRE-QIH-D
70.45
R@5
· 2016-11-26
Visual Dialog
Code
#52
ensemble, finetune
70.23
R@5
No paper
#53
gat_disc_relto_4
70.17
R@5
No paper
#54
shanshandu
70.15
R@5
No paper
#55
1
69.95
R@5
No paper
#56
1
69.65
R@5
No paper
#57
1
68.92
R@5
No paper
#58
P1P2+Distill+Ensemble
68.9
R@5
No paper
#59
1
68.67
R@5
No paper
#60
7
68.45
R@5
No paper
#61
VD-PCR
68.4
R@5
No paper
#62
Ensemble + Fine-tuning
68.12
R@5
No paper
#63
Disc, Dense, 4 Ensemble.
67.65
R@5
No paper
#64
Ensemble + Finetune
66.6
R@5
· 2019-11-26
Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs
Code
#65
gat_disc_3
65.85
R@5
No paper
#66
Single
65.7
R@5
No paper
#67
10
64.15
R@5
No paper
#68
CE-finetuned, single model
64.12
R@5
No paper
#69
20
63.28
R@5
No paper
#70
5_4
62.98
R@5
No paper
#71
5-2
62.88
R@5
No paper
#72
Ensemble
62.82
R@5
No paper
#73
2
62.42
R@5
No paper
#74
2
60.38
R@5
No paper
#75
simple_test
60.12
R@5
No paper
#76
mvan_len40_test
56.47
R@5
No paper
#77
trainval_ch_9
54.97
R@5
No paper
#78
5TS
53.62
R@5
No paper
#79
czczx
43.58
R@5
No paper
#80
qqhe
7.22
R@5
No paper