TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Single

Single

Reported on 19 benchmarks across 3 tasks

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing7 results

  • Visual Question Answering (VQA)onGQA Test2019
    Accuracy
    63.2
    best: 89.3 (human)
  • Visual Question Answering (VQA)onGQA Test2019
    Binary
    77.91
    best: 91.2 (human)
  • Visual Question Answering (VQA)onGQA Test2019
    Consistency
    89.84
    best: 98.4 (human)
  • Visual Question Answering (VQA)onGQA Test2019
    Distribution
    5.25
    best: 93.08 (GlobalPrior)
  • Visual Question Answering (VQA)onGQA Test2019
    Open
    50.22
    best: 87.4 (human)
  • Visual Question Answering (VQA)onGQA Test2019
    Plausibility
    85.15
    best: 97.2 (human)
  • Visual Question Answering (VQA)onGQA Test2019
    Validity
    96.47
    best: 98.9 (human)

Speech6 results

  • DialogueonVisual Dialog v1.0 test-std
    MRR (x 100)
    45.75
    best: 71.24 (MRR ensemble (Naive))
  • DialogueonVisual Dialog v1.0 test-std
    Mean
    6.54
    best: 49.61 (qqhe)
  • DialogueonVisual Dialog v1.0 test-std
    NDCG (x 100)
    78.7
  • DialogueonVisual Dialog v1.0 test-std
    R@1
    29.5
    best: 58.3 (2 Step: Factor Graph Attention + VD-Bert)
  • DialogueonVisual Dialog v1.0 test-std
    R@10
    82.45
    best: 95.08 (Ensemble FGA + BERT)
  • DialogueonVisual Dialog v1.0 test-std
    R@5
    65.7
    best: 88.42 (Ensemble FGA + BERT)

Computer Vision6 results

  • Visual DialogonVisual Dialog v1.0 test-std
    MRR (x 100)
    45.75
    best: 71.24 (MRR ensemble (Naive))
  • Visual DialogonVisual Dialog v1.0 test-std
    Mean
    6.54
    best: 49.61 (qqhe)
  • Visual DialogonVisual Dialog v1.0 test-std
    NDCG (x 100)
    78.7
  • Visual DialogonVisual Dialog v1.0 test-std
    R@1
    29.5
    best: 58.3 (2 Step: Factor Graph Attention + VD-Bert)
  • Visual DialogonVisual Dialog v1.0 test-std
    R@10
    82.45
    best: 95.08 (Ensemble FGA + BERT)
  • Visual DialogonVisual Dialog v1.0 test-std
    R@5
    65.7
    best: 88.42 (Ensemble FGA + BERT)