SketchyCOCO dataset consists of two parts:
Object-level data
Object-level data contains triplets of {foreground sketch, foreground image, foreground edge map} examples covering 14 classes, pairs of {background sketch, background image} examples covering 3 classes.
Scene-level data
Scene-level data contains pairs of {foreground image&background sketch, scene image} examples, pairs of {scene sketch, scene image} examples and the segmentation ground truth for scene sketches. Some val scene images come from the train images of the COCO-Stuff dataset for increasing the number of the val images of the SketchyCOCO dataset.
Source: https://github.com/sysu-imsl/SketchyCOCO
Image source: https://arxiv.org/pdf/2003.02683v5.pdf