SBU Captions Dataset

A collection that allows researchers to approach the extremely challenging problem of description generation using relatively simple non-parametric methods and produces surprisingly effective results.

Source: Im2Text: Describing Images Using 1 Million Captioned Photographs