Ensemble

Reported on 16 benchmarks across 4 tasks · 2 papers · 4 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Audio6 results

DialogueonVisual Dialog v1.0 test-std
MRR (x 100)
51.17
best: 71.24 (MRR ensemble (Naive))
DialogueonVisual Dialog v1.0 test-std
Mean
6.69
best: 49.61 (qqhe)
DialogueonVisual Dialog v1.0 test-std
NDCG (x 100)
75.35
best: 78.7 (Single)
DialogueonVisual Dialog v1.0 test-std
R@1
38.9
best: 58.3 (2 Step: Factor Graph Attention + VD-Bert)
DialogueonVisual Dialog v1.0 test-std
R@10
77.98
best: 95.08 (Ensemble FGA + BERT)
DialogueonVisual Dialog v1.0 test-std
R@5
62.82
best: 88.42 (Ensemble FGA + BERT)

Visual DialogonVisual Dialog v1.0 test-std
MRR (x 100)
51.17
best: 71.24 (MRR ensemble (Naive))
Visual DialogonVisual Dialog v1.0 test-std
Mean
6.69
best: 49.61 (qqhe)
Visual DialogonVisual Dialog v1.0 test-std
NDCG (x 100)
75.35
best: 78.7 (Single)
Visual DialogonVisual Dialog v1.0 test-std
R@1
38.9
best: 58.3 (2 Step: Factor Graph Attention + VD-Bert)
Visual DialogonVisual Dialog v1.0 test-std
R@10
77.98
best: 95.08 (Ensemble FGA + BERT)
Visual DialogonVisual Dialog v1.0 test-std
R@5
62.82
best: 88.42 (Ensemble FGA + BERT)