GD-VCR

ImagesTextsUnknownIntroduced 2021-09-14

Geo-Diverse Visual Commonsense Reasoning (GD-VCR) is a new dataset to test vision-and-language models' ability to understand cultural and geo-location-specific commonsense.

Image source: https://arxiv.org/pdf/2109.06860v1.pdf