TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Visual Question Answering (VQA)/COCO Visual Question Answering (VQA) real images 1.0 open ended

Visual Question Answering (VQA) on COCO Visual Question Answering (VQA) real images 1.0 open ended

Metric: Percentage correct (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Percentage correct▼Extra DataPaperDate↕Code
1MCB 7 att.66.5NoMultimodal Compact Bilinear Pooling for Visual Q...2016-06-06Code
2Dual-MFA66.09NoCo-attending Free-form Regions and Detections wi...2017-11-18Code
3QGHC+Att+Concat65.9NoQuestion-Guided Hybrid Convolution for Visual Qu...2018-08-08-
4RelAtt65.69NoR-VQA: Learning Visual Relation Facts with Seman...2018-05-24Code
5joint-loss63.2NoTraining Recurrent Answering Units with Joint Lo...2016-06-12-
6HQI+ResNet62.1NoHierarchical Question-Image Co-Attention for Vis...2016-05-31Code
7MRN + global features61.8NoMultimodal Residual Learning for Visual QA2016-06-05Code
8DMN+ [xiong2016dynamic]60.4NoDynamic Memory Networks for Visual and Textual Q...2016-03-04Code
9CNN-RNN59.5NoImage Captioning and Visual Question Answering B...2016-03-09-
10FDA59.5NoA Focused Dynamic Attention Model for Visual Que...2016-04-06-
11SAN58.9NoStacked Attention Networks for Image Question An...2015-11-07Code
12LSTM Q+I58.2NoVQA: Visual Question Answering2015-05-03Code
13SMem-VQA58.2NoAsk, Attend and Answer: Exploring Question-Guide...2015-11-17Code
14iBOWIMG baseline55.9NoSimple Baseline for Visual Question Answering2015-12-07Code