CapGaze

Consists of eye movements and verbal descriptions recorded synchronously over images.

Source: Human Attention in Image Captioning: Dataset and Analysis