SBU Captions Dataset
A collection that allows researchers to approach the extremely challenging problem of description generation using relatively simple non-parametric methods and produces surprisingly effective results.
Source: Im2Text: Describing Images Using 1 Million Captioned Photographs