Visual Question Answering (VQA) on GQA

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...
#ModelAccuracyExtra DataPaperDateCode
1PEVL+77NoPEVL: Position-enhanced Pre-training and Prompt ...2022-05-23Code
2RelViT65.54NoRelViT: Concept-guided Vision Transformer for Vi...2022-04-24Code
3LocVLM-L50.2NoLearning to Localize Objects Improves Spatial Re...2024-04-11Code