Zhao Zhang, Wenda Jin, Jun Xu, Ming-Ming Cheng
Co-saliency detection (Co-SOD) aims to segment the common salient foreground in a group of relevant images. In this paper, inspired by human behavior, we propose a gradient-induced co-saliency detection (GICD) method. We first abstract a consensus representation for the grouped images in the embedding space; then, by comparing the single image with consensus representation, we utilize the feedback gradient information to induce more attention to the discriminative co-salient features. In addition, due to the lack of Co-SOD training data, we design a jigsaw training strategy, with which Co-SOD networks can be trained on general saliency datasets without extra pixel-level annotations. To evaluate the performance of Co-SOD methods on discovering the co-salient object among multiple foregrounds, we construct a challenging CoCA dataset, where each image contains at least one extraneous foreground along with the co-salient object. Experiments demonstrate that our GICD achieves state-of-the-art performance. Our codes and dataset are available at https://mmcheng.net/gicd/.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Saliency Detection | CoSOD3k | MAE | 0.079 | GICD |
| Saliency Detection | CoSOD3k | S-measure | 0.797 | GICD |
| Saliency Detection | CoSOD3k | max E-measure | 0.848 | GICD |
| Saliency Detection | CoSOD3k | max F-measure | 0.77 | GICD |
| Saliency Detection | CoSOD3k | mean E-measure | 0.845 | GICD |
| Saliency Detection | CoSOD3k | mean F-measure | 0.763 | GICD |
| Saliency Detection | CoCA | MAE | 0.126 | GICD |
| Saliency Detection | CoCA | Mean F-measure | 0.504 | GICD |
| Saliency Detection | CoCA | S-measure | 0.658 | GICD |
| Saliency Detection | CoCA | max E-measure | 0.715 | GICD |
| Saliency Detection | CoCA | max F-measure | 0.513 | GICD |
| Saliency Detection | CoCA | mean E-measure | 0.701 | GICD |
| Saliency Detection | CoSal2015 | MAE | 0.071 | GICD |
| Saliency Detection | CoSal2015 | S-measure | 0.844 | GICD |
| Saliency Detection | CoSal2015 | max E-measure | 0.887 | GICD |
| Saliency Detection | CoSal2015 | max F-measure | 0.844 | GICD |
| Saliency Detection | CoSal2015 | mean E-measure | 0.883 | GICD |
| Saliency Detection | CoSal2015 | mean F-measure | 0.835 | GICD |
| Object Detection | CoSOD3k | MAE | 0.079 | GICD |
| Object Detection | CoSOD3k | S-measure | 0.797 | GICD |
| Object Detection | CoSOD3k | max E-measure | 0.848 | GICD |
| Object Detection | CoSOD3k | max F-measure | 0.77 | GICD |
| Object Detection | CoSOD3k | mean E-measure | 0.845 | GICD |
| Object Detection | CoSOD3k | mean F-measure | 0.763 | GICD |
| Object Detection | CoCA | MAE | 0.126 | GICD |
| Object Detection | CoCA | Mean F-measure | 0.504 | GICD |
| Object Detection | CoCA | S-measure | 0.658 | GICD |
| Object Detection | CoCA | max E-measure | 0.715 | GICD |
| Object Detection | CoCA | max F-measure | 0.513 | GICD |
| Object Detection | CoCA | mean E-measure | 0.701 | GICD |
| Object Detection | CoSal2015 | MAE | 0.071 | GICD |
| Object Detection | CoSal2015 | S-measure | 0.844 | GICD |
| Object Detection | CoSal2015 | max E-measure | 0.887 | GICD |
| Object Detection | CoSal2015 | max F-measure | 0.844 | GICD |
| Object Detection | CoSal2015 | mean E-measure | 0.883 | GICD |
| Object Detection | CoSal2015 | mean F-measure | 0.835 | GICD |
| 3D | CoSOD3k | MAE | 0.079 | GICD |
| 3D | CoSOD3k | S-measure | 0.797 | GICD |
| 3D | CoSOD3k | max E-measure | 0.848 | GICD |
| 3D | CoSOD3k | max F-measure | 0.77 | GICD |
| 3D | CoSOD3k | mean E-measure | 0.845 | GICD |
| 3D | CoSOD3k | mean F-measure | 0.763 | GICD |
| 3D | CoCA | MAE | 0.126 | GICD |
| 3D | CoCA | Mean F-measure | 0.504 | GICD |
| 3D | CoCA | S-measure | 0.658 | GICD |
| 3D | CoCA | max E-measure | 0.715 | GICD |
| 3D | CoCA | max F-measure | 0.513 | GICD |
| 3D | CoCA | mean E-measure | 0.701 | GICD |
| 3D | CoSal2015 | MAE | 0.071 | GICD |
| 3D | CoSal2015 | S-measure | 0.844 | GICD |
| 3D | CoSal2015 | max E-measure | 0.887 | GICD |
| 3D | CoSal2015 | max F-measure | 0.844 | GICD |
| 3D | CoSal2015 | mean E-measure | 0.883 | GICD |
| 3D | CoSal2015 | mean F-measure | 0.835 | GICD |
| RGB Salient Object Detection | CoSOD3k | MAE | 0.079 | GICD |
| RGB Salient Object Detection | CoSOD3k | S-measure | 0.797 | GICD |
| RGB Salient Object Detection | CoSOD3k | max E-measure | 0.848 | GICD |
| RGB Salient Object Detection | CoSOD3k | max F-measure | 0.77 | GICD |
| RGB Salient Object Detection | CoSOD3k | mean E-measure | 0.845 | GICD |
| RGB Salient Object Detection | CoSOD3k | mean F-measure | 0.763 | GICD |
| RGB Salient Object Detection | CoCA | MAE | 0.126 | GICD |
| RGB Salient Object Detection | CoCA | Mean F-measure | 0.504 | GICD |
| RGB Salient Object Detection | CoCA | S-measure | 0.658 | GICD |
| RGB Salient Object Detection | CoCA | max E-measure | 0.715 | GICD |
| RGB Salient Object Detection | CoCA | max F-measure | 0.513 | GICD |
| RGB Salient Object Detection | CoCA | mean E-measure | 0.701 | GICD |
| RGB Salient Object Detection | CoSal2015 | MAE | 0.071 | GICD |
| RGB Salient Object Detection | CoSal2015 | S-measure | 0.844 | GICD |
| RGB Salient Object Detection | CoSal2015 | max E-measure | 0.887 | GICD |
| RGB Salient Object Detection | CoSal2015 | max F-measure | 0.844 | GICD |
| RGB Salient Object Detection | CoSal2015 | mean E-measure | 0.883 | GICD |
| RGB Salient Object Detection | CoSal2015 | mean F-measure | 0.835 | GICD |
| 2D Classification | CoSOD3k | MAE | 0.079 | GICD |
| 2D Classification | CoSOD3k | S-measure | 0.797 | GICD |
| 2D Classification | CoSOD3k | max E-measure | 0.848 | GICD |
| 2D Classification | CoSOD3k | max F-measure | 0.77 | GICD |
| 2D Classification | CoSOD3k | mean E-measure | 0.845 | GICD |
| 2D Classification | CoSOD3k | mean F-measure | 0.763 | GICD |
| 2D Classification | CoCA | MAE | 0.126 | GICD |
| 2D Classification | CoCA | Mean F-measure | 0.504 | GICD |
| 2D Classification | CoCA | S-measure | 0.658 | GICD |
| 2D Classification | CoCA | max E-measure | 0.715 | GICD |
| 2D Classification | CoCA | max F-measure | 0.513 | GICD |
| 2D Classification | CoCA | mean E-measure | 0.701 | GICD |
| 2D Classification | CoSal2015 | MAE | 0.071 | GICD |
| 2D Classification | CoSal2015 | S-measure | 0.844 | GICD |
| 2D Classification | CoSal2015 | max E-measure | 0.887 | GICD |
| 2D Classification | CoSal2015 | max F-measure | 0.844 | GICD |
| 2D Classification | CoSal2015 | mean E-measure | 0.883 | GICD |
| 2D Classification | CoSal2015 | mean F-measure | 0.835 | GICD |
| 2D Object Detection | CoSOD3k | MAE | 0.079 | GICD |
| 2D Object Detection | CoSOD3k | S-measure | 0.797 | GICD |
| 2D Object Detection | CoSOD3k | max E-measure | 0.848 | GICD |
| 2D Object Detection | CoSOD3k | max F-measure | 0.77 | GICD |
| 2D Object Detection | CoSOD3k | mean E-measure | 0.845 | GICD |
| 2D Object Detection | CoSOD3k | mean F-measure | 0.763 | GICD |
| 2D Object Detection | CoCA | MAE | 0.126 | GICD |
| 2D Object Detection | CoCA | Mean F-measure | 0.504 | GICD |
| 2D Object Detection | CoCA | S-measure | 0.658 | GICD |
| 2D Object Detection | CoCA | max E-measure | 0.715 | GICD |
| 2D Object Detection | CoCA | max F-measure | 0.513 | GICD |
| 2D Object Detection | CoCA | mean E-measure | 0.701 | GICD |
| 2D Object Detection | CoSal2015 | MAE | 0.071 | GICD |
| 2D Object Detection | CoSal2015 | S-measure | 0.844 | GICD |
| 2D Object Detection | CoSal2015 | max E-measure | 0.887 | GICD |
| 2D Object Detection | CoSal2015 | max F-measure | 0.844 | GICD |
| 2D Object Detection | CoSal2015 | mean E-measure | 0.883 | GICD |
| 2D Object Detection | CoSal2015 | mean F-measure | 0.835 | GICD |
| 16k | CoSOD3k | MAE | 0.079 | GICD |
| 16k | CoSOD3k | S-measure | 0.797 | GICD |
| 16k | CoSOD3k | max E-measure | 0.848 | GICD |
| 16k | CoSOD3k | max F-measure | 0.77 | GICD |
| 16k | CoSOD3k | mean E-measure | 0.845 | GICD |
| 16k | CoSOD3k | mean F-measure | 0.763 | GICD |
| 16k | CoCA | MAE | 0.126 | GICD |
| 16k | CoCA | Mean F-measure | 0.504 | GICD |
| 16k | CoCA | S-measure | 0.658 | GICD |
| 16k | CoCA | max E-measure | 0.715 | GICD |
| 16k | CoCA | max F-measure | 0.513 | GICD |
| 16k | CoCA | mean E-measure | 0.701 | GICD |
| 16k | CoSal2015 | MAE | 0.071 | GICD |
| 16k | CoSal2015 | S-measure | 0.844 | GICD |
| 16k | CoSal2015 | max E-measure | 0.887 | GICD |
| 16k | CoSal2015 | max F-measure | 0.844 | GICD |
| 16k | CoSal2015 | mean E-measure | 0.883 | GICD |
| 16k | CoSal2015 | mean F-measure | 0.835 | GICD |