Visual Question Answering (VQA) on VCR (Q-A) dev

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...
#ModelAccuracyExtra DataPaperDateCode
1VL-BERTLARGE75.5NoVL-BERT: Pre-training of Generic Visual-Linguist...2019-08-22Code
2VL-BERTBASE73.8NoVL-BERT: Pre-training of Generic Visual-Linguist...2019-08-22Code
3VisualBERT70.8NoVisualBERT: A Simple and Performant Baseline for...2019-08-09Code