COCO-Text

ImagesTextsCreative Commons Attribution 4.0 LicenseIntroduced 2016-01-01

The COCO-Text dataset is a dataset for text detection and recognition. It is based on the MS COCO dataset, which contains images of complex everyday scenes. The COCO-Text dataset contains non-text images, legible text images and illegible text images. In total there are 22184 training images and 7026 validation images with at least one instance of legible text.

Source: Improving Text Proposals for Scene Images with Fully Convolutional Networks Image Source: https://vision.cornell.edu/se3/coco-text-2/