Single

Reported on 19 benchmarks across 3 tasks

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing7 results

DialogueonVisual Dialog v1.0 test-std
MRR (x 100)
45.75
best: 71.24 (MRR ensemble (Naive))
DialogueonVisual Dialog v1.0 test-std
Mean
6.54
best: 49.61 (qqhe)
DialogueonVisual Dialog v1.0 test-std
NDCG (x 100)
78.7
DialogueonVisual Dialog v1.0 test-std
R@1
29.5
best: 58.3 (2 Step: Factor Graph Attention + VD-Bert)
DialogueonVisual Dialog v1.0 test-std
R@10
82.45
best: 95.08 (Ensemble FGA + BERT)
DialogueonVisual Dialog v1.0 test-std
R@5
65.7
best: 88.42 (Ensemble FGA + BERT)

Visual DialogonVisual Dialog v1.0 test-std
MRR (x 100)
45.75
best: 71.24 (MRR ensemble (Naive))
Visual DialogonVisual Dialog v1.0 test-std
Mean
6.54
best: 49.61 (qqhe)
Visual DialogonVisual Dialog v1.0 test-std
NDCG (x 100)
78.7
Visual DialogonVisual Dialog v1.0 test-std
R@1
29.5
best: 58.3 (2 Step: Factor Graph Attention + VD-Bert)
Visual DialogonVisual Dialog v1.0 test-std
R@10
82.45
best: 95.08 (Ensemble FGA + BERT)
Visual DialogonVisual Dialog v1.0 test-std
R@5
65.7
best: 88.42 (Ensemble FGA + BERT)