Visual Question Answering (VQA) on VCR (Q-AR) dev

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...
#ModelAccuracyExtra DataPaperDateCode
1VL-BERTLARGE58.9NoVL-BERT: Pre-training of Generic Visual-Linguist...2019-08-22Code
2VL-BERTBASE55.2NoVL-BERT: Pre-training of Generic Visual-Linguist...2019-08-22Code
3VisualBERT52.2NoVisualBERT: A Simple and Performant Baseline for...2019-08-09Code