Metric: 1/4 (higher is better)
| # | Model↕ | 1/4▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | CFMMC-Align | 50.2 | No | - | - | - |
| 2 | Tem-adapter | 46 | No | Tem-adapter: Adapting Image-Text Pretraining for... | 2023-08-16 | Code |
| 3 | Eclipse | 37.05 | No | SUTD-TrafficQA: A Question Answering Benchmark a... | 2021-03-29 | Code |
| 4 | HCRN | 36.49 | No | Hierarchical Conditional Relation Networks for V... | 2020-02-25 | Code |
| 5 | TVQA | 35.16 | No | TVQA: Localized, Compositional Video Question An... | 2018-09-05 | Code |
| 6 | VIS+LST | 29.91 | No | Exploring Models and Data for Image Question Ans... | 2015-05-08 | Code |