TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/UC-Net: Uncertainty Inspired RGB-D Saliency Detection via ...

UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders

Jing Zhang, Deng-Ping Fan, Yuchao Dai, Saeed Anwar, Fatemeh Sadat Saleh, Tong Zhang, Nick Barnes

2020-04-13CVPR 2020 6Thermal Image SegmentationRGB-D Salient Object DetectionSaliency Detection
PaperPDFCode

Abstract

In this paper, we propose the first framework (UCNet) to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection methods treat the saliency detection task as a point estimation problem, and produce a single saliency map following a deterministic learning pipeline. Inspired by the saliency data labeling process, we propose probabilistic RGB-D saliency detection network via conditional variational autoencoders to model human annotation uncertainty and generate multiple saliency maps for each input image by sampling in the latent space. With the proposed saliency consensus process, we are able to generate an accurate saliency map based on these multiple predictions. Quantitative and qualitative evaluations on six challenging benchmark datasets against 18 competing algorithms demonstrate the effectiveness of our approach in learning the distribution of saliency maps, leading to a new state-of-the-art in RGB-D saliency detection.

Results

TaskDatasetMetricValueModel
Semantic SegmentationRGB-T-Glass-SegmentationMAE0.071UCNet
Object DetectionNJU2KAverage MAE0.043UC-Net
Object DetectionNJU2KS-Measure89.7UC-Net
Object DetectionSTEREAverage MAE0.039UC-Net
Object DetectionSTERES-Measure90.3UC-Net
Object DetectionLFSDAverage MAE0.066UC-Net
Object DetectionLFSDS-Measure86.4UC-Net
Object DetectionSIPAverage MAE0.051UC-Net
Object DetectionSIPS-Measure87.5UC-Net
Object DetectionNLPRAverage MAE0.025UC-Net
Object DetectionNLPRS-Measure92UC-Net
Object DetectionDESAverage MAE0.019UC-Net
Object DetectionDESS-Measure93.4UC-Net
3DNJU2KAverage MAE0.043UC-Net
3DNJU2KS-Measure89.7UC-Net
3DSTEREAverage MAE0.039UC-Net
3DSTERES-Measure90.3UC-Net
3DLFSDAverage MAE0.066UC-Net
3DLFSDS-Measure86.4UC-Net
3DSIPAverage MAE0.051UC-Net
3DSIPS-Measure87.5UC-Net
3DNLPRAverage MAE0.025UC-Net
3DNLPRS-Measure92UC-Net
3DDESAverage MAE0.019UC-Net
3DDESS-Measure93.4UC-Net
2D ClassificationNJU2KAverage MAE0.043UC-Net
2D ClassificationNJU2KS-Measure89.7UC-Net
2D ClassificationSTEREAverage MAE0.039UC-Net
2D ClassificationSTERES-Measure90.3UC-Net
2D ClassificationLFSDAverage MAE0.066UC-Net
2D ClassificationLFSDS-Measure86.4UC-Net
2D ClassificationSIPAverage MAE0.051UC-Net
2D ClassificationSIPS-Measure87.5UC-Net
2D ClassificationNLPRAverage MAE0.025UC-Net
2D ClassificationNLPRS-Measure92UC-Net
2D ClassificationDESAverage MAE0.019UC-Net
2D ClassificationDESS-Measure93.4UC-Net
Scene SegmentationRGB-T-Glass-SegmentationMAE0.071UCNet
2D Object DetectionNJU2KAverage MAE0.043UC-Net
2D Object DetectionNJU2KS-Measure89.7UC-Net
2D Object DetectionSTEREAverage MAE0.039UC-Net
2D Object DetectionSTERES-Measure90.3UC-Net
2D Object DetectionLFSDAverage MAE0.066UC-Net
2D Object DetectionLFSDS-Measure86.4UC-Net
2D Object DetectionSIPAverage MAE0.051UC-Net
2D Object DetectionSIPS-Measure87.5UC-Net
2D Object DetectionNLPRAverage MAE0.025UC-Net
2D Object DetectionNLPRS-Measure92UC-Net
2D Object DetectionDESAverage MAE0.019UC-Net
2D Object DetectionDESS-Measure93.4UC-Net
2D Object DetectionRGB-T-Glass-SegmentationMAE0.071UCNet
10-shot image generationRGB-T-Glass-SegmentationMAE0.071UCNet
16kNJU2KAverage MAE0.043UC-Net
16kNJU2KS-Measure89.7UC-Net
16kSTEREAverage MAE0.039UC-Net
16kSTERES-Measure90.3UC-Net
16kLFSDAverage MAE0.066UC-Net
16kLFSDS-Measure86.4UC-Net
16kSIPAverage MAE0.051UC-Net
16kSIPS-Measure87.5UC-Net
16kNLPRAverage MAE0.025UC-Net
16kNLPRS-Measure92UC-Net
16kDESAverage MAE0.019UC-Net
16kDESS-Measure93.4UC-Net

Related Papers

Feature Hallucination for Self-supervised Action Recognition2025-06-25Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation2025-05-11Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective2025-05-07Low-Rate Semantic Communication with Codebook-based Conditional Generative Models2025-04-07Collaborative Temporal Consistency Learning for Point-supervised Natural Language Video Localization2025-03-22A Deep Learning Framework for Visual Attention Prediction and Analysis of News Interfaces2025-03-21Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance2025-03-04Dual Mutual Learning Network with Global-local Awareness for RGB-D Salient Object Detection2025-01-03