G-VUE

General-purpose Visual Understanding Evaluation

ImagesTextsIntroduced 2022-11-28

General-purpose Visual Understanding Evaluation (G-VUE) is a comprehensive benchmark covering the full spectrum of visual cognitive abilities with four functional domains -- Perceive, Ground, Reason, and Act. The four domains are embodied in 11 carefully curated tasks, from 3D reconstruction to visual reasoning and manipulation.

Source: Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation

Image Source: https://github.com/wllmzhu/G-VUE