Metric: Accuracy (higher is better)
| # | Model↕ | Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | SAAA (ResNet) | 64.5 | No | Show, Ask, Attend, and Answer: A Strong Baseline... | 2017-04-11 | Code |
| 2 | DAN (ResNet) | 64.3 | No | Dual Attention Networks for Multimodal Reasoning... | 2016-11-02 | Code |
| 3 | MCB (ResNet) | 64.2 | No | Multimodal Compact Bilinear Pooling for Visual Q... | 2016-06-06 | Code |
| 4 | RAU (ResNet) | 63.3 | No | Training Recurrent Answering Units with Joint Lo... | 2016-06-12 | - |
| 5 | HieCoAtt (ResNet) | 61.8 | No | Hierarchical Question-Image Co-Attention for Vis... | 2016-05-31 | Code |
| 6 | DMN+ | 60.3 | No | Dynamic Memory Networks for Visual and Textual Q... | 2016-03-04 | Code |
| 7 | NMN+LSTM+FT | 58.6 | No | Neural Module Networks | 2015-11-09 | Code |