ORGaze

Videos

A new video dataset for OR, with 30, 000 objects over 5, 000 stereo video sequences annotated for their descriptions and gaze.

Source: Object Referring in Videos with Language and Human Gaze