TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Visual Question Answering (VQA)/VQA v1 test-dev

Visual Question Answering (VQA) on VQA v1 test-dev

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1SAAA (ResNet)64.5NoShow, Ask, Attend, and Answer: A Strong Baseline...2017-04-11Code
2DAN (ResNet)64.3NoDual Attention Networks for Multimodal Reasoning...2016-11-02Code
3MCB (ResNet)64.2NoMultimodal Compact Bilinear Pooling for Visual Q...2016-06-06Code
4RAU (ResNet)63.3NoTraining Recurrent Answering Units with Joint Lo...2016-06-12-
5HieCoAtt (ResNet)61.8NoHierarchical Question-Image Co-Attention for Vis...2016-05-31Code
6DMN+60.3NoDynamic Memory Networks for Visual and Textual Q...2016-03-04Code
7NMN+LSTM+FT58.6NoNeural Module Networks2015-11-09Code