The problem of retrieving images from a database based on a multi-modal (image- text) query. Specifically, the query text prompts some modification in the query image and the task is to retrieve images with the desired modifications.
Benchmarks
Image Retrieval with Multi-Modal Query on COCO 2014