GD-VCR
ImagesTextsUnknownIntroduced 2021-09-14
Geo-Diverse Visual Commonsense Reasoning (GD-VCR) is a new dataset to test vision-and-language models' ability to understand cultural and geo-location-specific commonsense.
Image source: https://arxiv.org/pdf/2109.06860v1.pdf