TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Visual Question Answering (VQA)/COCO Visual Question Answering (VQA) real images 1.0 multiple choice

Visual Question Answering (VQA) on COCO Visual Question Answering (VQA) real images 1.0 multiple choice

Metric: Percentage correct (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Percentage correct▼Extra DataPaperDate↕Code
1MCB 7 att.70.1NoMultimodal Compact Bilinear Pooling for Visual Q...2016-06-06Code
2Dual-MFA70.04NoCo-attending Free-form Regions and Detections wi...2017-11-18Code
3RelAtt69.6NoR-VQA: Learning Visual Relation Facts with Seman...2018-05-24Code
43-Modalities: Unary + Pairwise + Ternary (ResNet)69.3NoHigh-Order Attention Models for Visual Question ...2017-11-12Code
5joint-loss67.3NoTraining Recurrent Answering Units with Joint Lo...2016-06-12-
6MRN66.3NoMultimodal Residual Learning for Visual QA2016-06-05Code
7HQI+ResNet66.1NoHierarchical Question-Image Co-Attention for Vis...2016-05-31Code
8FDA64.2NoA Focused Dynamic Attention Model for Visual Que...2016-04-06-
9LSTM Q+I63.1NoVQA: Visual Question Answering2015-05-03Code
10iBOWIMG baseline62NoSimple Baseline for Visual Question Answering2015-12-07Code