Metric: Binary (higher is better)
| # | Model↕ | Binary▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | human | 91.2 | No | - | - | - |
| 2 | DREAM+Unicoder-VL (MSRA) | 84.46 | No | - | - | - |
| 3 | VinVL-DPT | 82.63 | No | - | - | - |
| 4 | Single Model | 82.63 | No | VinVL: Revisiting Visual Representations in Visi... | 2021-01-02 | Code |
| 5 | VinVL+L | 82.59 | No | - | - | Code |
| 6 | TRRNet (Ensemble) | 82.12 | No | - | - | - |
| 7 | Coarse-to-Fine Reasoning, Single Model | 81.16 | No | - | - | - |
| 8 | MDETR | 80.91 | No | - | - | - |
| 9 | Wayne | 80.84 | No | - | - | - |
| 10 | MIL-nbgao | 80.8 | No | - | - | - |
| 11 | NSM ensemble (updated) | 80.45 | No | - | - | - |
| 12 | 1-gqa | 80.28 | No | - | - | - |
| 13 | LXR955, Ensemble | 79.79 | No | LXMERT: Learning Cross-Modality Encoder Represen... | 2019-08-20 | Code |
| 14 | Kakao Brain | 79.68 | No | - | - | - |
| 15 | Ensemble10 | 79.12 | No | - | - | - |
| 16 | Musan | 79.09 | No | - | - | - |
| 17 | NSM single (updated) | 78.94 | No | - | - | - |
| 18 | Meta Module, Single | 78.9 | No | - | - | - |
| 19 | GRN | 78.69 | No | Bilinear Graph Networks for Visual Question Answ... | 2019-07-23 | - |
| 20 | LININ | 78.44 | No | - | - | - |
| 21 | ckpt 19 exp 90 | 78.41 | No | - | - | - |
| 22 | UCM | 78.4 | No | - | - | - |
| 23 | lxmert-adv-txt | 78.07 | No | - | - | - |
| 24 | IQA (single) | 78.07 | No | - | - | - |
| 25 | mlmbert | 78.02 | No | - | - | - |
| 26 | fbe20v3.json | 78.02 | No | - | - | - |
| 27 | PVR | 78.02 | No | - | - | - |
| 28 | lxmert-adv-txt | 77.99 | No | - | - | - |
| 29 | DL16 | 77.98 | No | - | - | - |
| 30 | DAM | 77.97 | No | - | - | - |
| 31 | Single | 77.91 | No | - | - | - |
| 32 | MSM@MSRA | 77.84 | No | - | - | - |
| 33 | 45 | 77.83 | No | - | - | - |
| 34 | rishabh_test | 77.53 | No | - | - | - |
| 35 | 270 | 77.5 | No | - | - | - |
| 36 | xpj | 77.41 | No | - | - | - |
| 37 | Partial-MSP | 77.39 | No | - | - | - |
| 38 | fisher | 77.32 | No | - | - | - |
| 39 | UNITER + MAC + Graph Networks | 77.31 | No | - | - | - |
| 40 | Future_Test_team | 77.19 | No | - | - | - |
| 41 | LXR955, Single Model | 77.16 | No | LXMERT: Learning Cross-Modality Encoder Represen... | 2019-08-20 | Code |
| 42 | tmp | 77.15 | No | - | - | - |
| 43 | IIE_Morningstar | 77.13 | No | - | - | - |
| 44 | vv69 | 77.12 | No | - | - | - |
| 45 | mcmi | 77.11 | No | - | - | - |
| 46 | bert_v1 | 77.09 | No | - | - | - |
| 47 | full_nsp_ft_results_submit_predict.json | 76.99 | No | - | - | - |
| 48 | TESTOVQA007 | 76.97 | No | - | - | - |
| 49 | prompt IMT-16 | 76.87 | No | - | - | - |
| 50 | test gqa | 76.84 | No | - | - | - |
| 51 | Inspur | 76.84 | No | - | - | - |
| 52 | gaochongyang9 | 76.79 | No | - | - | - |
| 53 | SSRP | 76.77 | No | - | - | - |
| 54 | BgTest | 76.74 | No | - | - | - |
| 55 | LXMERT-S | 76.69 | No | - | - | - |
| 56 | happyTeam | 76.6 | No | - | - | - |
| 57 | ours-4-gqa_el_tag_v4__pretrain_rel_tag_dist_tc_v7_checkpoint-47-157510-best-4.json | 76.4 | No | - | - | - |
| 58 | stu09e | 76.39 | No | - | - | - |
| 59 | full_nsp_mlm_ft_joint_results_submit_predict.json | 76.37 | No | - | - | - |
| 60 | gbert1 | 76.08 | No | - | - | - |
| 61 | QGCRGN | 76.07 | No | - | - | - |
| 62 | BAN | 76 | No | - | - | - |
| 63 | UCAS-SARI | 75.91 | No | - | - | - |
| 64 | REX | 75.78 | No | - | - | - |
| 65 | VqaStar-UCAS-SARI | 75.37 | No | - | - | - |
| 66 | MLVQA (single) | 75.22 | No | - | - | - |
| 67 | glimple_all | 75.07 | No | - | - | - |
| 68 | rsa-14word | 75.07 | No | - | - | - |
| 69 | DeeTee | 75.07 | No | - | - | - |
| 70 | wcf-fight | 75.01 | No | - | - | - |
| 71 | GM6_9_2_train | 74.97 | No | - | - | - |
| 72 | Feb_ft2_mergeadd_weightalllstm_picklocw_box5_prep | 74.84 | No | - | - | - |
| 73 | RSN (Single Model) | 74.78 | No | - | - | - |
| 74 | total14 | 74.62 | No | - | - | - |
| 75 | graphRepresentation, Single | 74.54 | No | - | - | - |
| 76 | result_run_2647872_epoch11 | 74.46 | No | - | - | - |
| 77 | ST_VQA | 73.9 | No | - | - | - |
| 78 | LCGN | 73.77 | No | - | - | - |
| 79 | MMT-VQA | 73.73 | No | - | - | - |
| 80 | Testify | 73.65 | No | - | - | - |
| 81 | GIN | 73.56 | No | - | - | - |
| 82 | Improved SNMN | 73.4 | No | - | - | - |
| 83 | F205 | 73 | No | - | - | - |
| 84 | Deepblue_Semantics | 72.88 | No | - | - | - |
| 85 | nogg | 72.87 | No | - | - | - |
| 86 | LW | 72.86 | No | - | - | - |
| 87 | IWantADonut | 72.84 | No | - | - | - |
| 88 | LOGNet+VLR | 72.65 | No | - | - | - |
| 89 | abc_test | 72.65 | No | - | - | - |
| 90 | 5TMT-qe+o | 72.52 | No | - | - | - |
| 91 | HDU_ZWF | 72.42 | No | - | - | - |
| 92 | RSN (Single Model)_v6 | 72.39 | No | - | - | - |
| 93 | KU | 72.09 | No | - | - | - |
| 94 | RD | 71.81 | No | - | - | - |
| 95 | Eden_test | 71.7 | No | - | - | - |
| 96 | MAC | 71.23 | No | - | - | - |
| 97 | Sorbonne | 70.41 | No | - | - | - |
| 98 | test | 70.15 | No | - | - | - |
| 99 | Space Cat | 69.36 | No | - | - | - |
| 100 | vips | 69.3 | No | - | - | - |
| 101 | MJ | 69.15 | No | - | - | - |
| 102 | UJCNN | 68.46 | No | - | - | - |
| 103 | ZhaoLab | 68.44 | No | - | - | - |
| 104 | Mithrandir | 67.99 | No | - | - | - |
| 105 | happy | 67.82 | No | - | - | - |
| 106 | RAM_BUGGY | 67.59 | No | - | - | - |
| 107 | mac_qin | 67.35 | No | - | - | - |
| 108 | BottomUp | 66.64 | No | Bottom-Up and Top-Down Attention for Image Capti... | 2017-07-25 | Code |
| 109 | sparsemax15 | 66.57 | No | - | - | - |
| 110 | LAS | 66.28 | No | - | - | - |
| 111 | RES | 65.02 | No | - | - | - |
| 112 | 113 | 64.74 | No | - | - | - |
| 113 | mfb+bert | 63.85 | No | - | - | - |
| 114 | LSTM+CNN | 63.26 | No | - | - | - |
| 115 | LSTM | 61.9 | No | - | - | - |
| 116 | CHAIR | 61.21 | No | - | - | - |
| 117 | Academia Sinica | 61.18 | No | - | - | - |
| 118 | bear | 59.24 | No | - | - | - |
| 119 | test | 58.76 | No | - | - | - |
| 120 | Ediburgh-Mila-UCLA | 57.57 | No | - | - | - |
| 121 | Fj | 56.61 | No | - | - | - |
| 122 | Mycsulb | 55.24 | No | - | - | - |
| 123 | MReaL | 55.12 | No | - | - | - |
| 124 | LocalPrior | 47.9 | No | - | - | - |
| 125 | muc_ai | 45.69 | No | - | - | - |
| 126 | GlobalPrior | 42.94 | No | - | - | - |
| 127 | CNN | 36.05 | No | - | - | - |