GVLQA
Graph Vision-Language Question-Answering
GraphsImagesMITIntroduced 2024-02-03
GVLQA is the first vision-language QA dataset for general graph reasoning. Contains a base set GVLQA-BASE and four image-augmented subsets GVLQA-AUGLY, GVLQA-AUGNO, GVLQA-AUGNS, GVLQA-AUGET, where the samples are relatively corresponding with the base set. Contains 7 graph reasoning tasks: detecting cycle, connectivity, computing topological ordering, shortest path, maximum flow, bipartite matching num, and Hamilton path. Utility:
- evaluate the graph reasoning capabilities of VLMs or LLMs;
- help models acquire fundamental graph comprehension and reasoning abilities as a pretraining dataset.