TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Bifurcated backbone strategy for RGB-D salient object dete...

Bifurcated backbone strategy for RGB-D salient object detection

Yingjie Zhai, Deng-Ping Fan, Jufeng Yang, Ali Borji, Ling Shao, Junwei Han, Liang Wang

2020-07-06Salient Object DetectionRGB-D Salient Object Detectionobject-detectionObject DetectionRGB Salient Object Detection
PaperPDFCode(official)Code

Abstract

Multi-level feature fusion is a fundamental topic in computer vision. It has been exploited to detect, segment and classify objects at various scales. When multi-level features meet multi-modal cues, the optimal feature aggregation and multi-modal learning strategy become a hot potato. In this paper, we leverage the inherent multi-modal and multi-level nature of RGB-D salient object detection to devise a novel cascaded refinement network. In particular, first, we propose to regroup the multi-level features into teacher and student features using a bifurcated backbone strategy (BBS). Second, we introduce a depth-enhanced module (DEM) to excavate informative depth cues from the channel and spatial views. Then, RGB and depth modalities are fused in a complementary way. Our architecture, named Bifurcated Backbone Strategy Network (BBS-Net), is simple, efficient, and backbone-independent. Extensive experiments show that BBS-Net significantly outperforms eighteen SOTA models on eight challenging datasets under five evaluation measures, demonstrating the superiority of our approach ($\sim 4 \%$ improvement in S-measure $vs.$ the top-ranked model: DMRA-iccv2019). In addition, we provide a comprehensive analysis on the generalization ability of different RGB-D datasets and provide a powerful training set for future research.

Results

TaskDatasetMetricValueModel
Object DetectionSTEREAverage MAE0.041BBS-Net
Object DetectionSTERES-Measure90.8BBS-Net
Object DetectionSTEREmax E-Measure94.2BBS-Net
Object DetectionSTEREmax F-Measure90.3BBS-Net
Object DetectionLFSDAverage MAE0.072BBS-Net
Object DetectionLFSDS-Measure86.4BBS-Net
Object DetectionLFSDmax E-Measure90.1BBS-Net
Object DetectionLFSDmax F-Measure85.8BBS-Net
Object DetectionSIPAverage MAE0.055BBS-Net
Object DetectionSIPS-Measure87.9BBS-Net
Object DetectionSIPmax E-Measure92.2BBS-Net
Object DetectionSIPmax F-Measure88.3BBS-Net
Object DetectionRGBD135Average MAE0.044BBS-Net
Object DetectionRGBD135S-Measure88.2BBS-Net
Object DetectionRGBD135max E-Measure91.9BBS-Net
Object DetectionRGBD135max F-Measure85.9BBS-Net
Object DetectionNLPRAverage MAE0.023BBS-Net
Object DetectionNLPRS-Measure93BBS-Net
Object DetectionNLPRmax E-Measure96.1BBS-Net
Object DetectionNLPRmax F-Measure91.8BBS-Net
Object DetectionDESAverage MAE0.021BBS-Net
Object DetectionDESS-Measure93.3BBS-Net
Object DetectionDESmax E-Measure96.6BBS-Net
Object DetectionDESmax F-Measure92.7BBS-Net
3DSTEREAverage MAE0.041BBS-Net
3DSTERES-Measure90.8BBS-Net
3DSTEREmax E-Measure94.2BBS-Net
3DSTEREmax F-Measure90.3BBS-Net
3DLFSDAverage MAE0.072BBS-Net
3DLFSDS-Measure86.4BBS-Net
3DLFSDmax E-Measure90.1BBS-Net
3DLFSDmax F-Measure85.8BBS-Net
3DSIPAverage MAE0.055BBS-Net
3DSIPS-Measure87.9BBS-Net
3DSIPmax E-Measure92.2BBS-Net
3DSIPmax F-Measure88.3BBS-Net
3DRGBD135Average MAE0.044BBS-Net
3DRGBD135S-Measure88.2BBS-Net
3DRGBD135max E-Measure91.9BBS-Net
3DRGBD135max F-Measure85.9BBS-Net
3DNLPRAverage MAE0.023BBS-Net
3DNLPRS-Measure93BBS-Net
3DNLPRmax E-Measure96.1BBS-Net
3DNLPRmax F-Measure91.8BBS-Net
3DDESAverage MAE0.021BBS-Net
3DDESS-Measure93.3BBS-Net
3DDESmax E-Measure96.6BBS-Net
3DDESmax F-Measure92.7BBS-Net
2D ClassificationSTEREAverage MAE0.041BBS-Net
2D ClassificationSTERES-Measure90.8BBS-Net
2D ClassificationSTEREmax E-Measure94.2BBS-Net
2D ClassificationSTEREmax F-Measure90.3BBS-Net
2D ClassificationLFSDAverage MAE0.072BBS-Net
2D ClassificationLFSDS-Measure86.4BBS-Net
2D ClassificationLFSDmax E-Measure90.1BBS-Net
2D ClassificationLFSDmax F-Measure85.8BBS-Net
2D ClassificationSIPAverage MAE0.055BBS-Net
2D ClassificationSIPS-Measure87.9BBS-Net
2D ClassificationSIPmax E-Measure92.2BBS-Net
2D ClassificationSIPmax F-Measure88.3BBS-Net
2D ClassificationRGBD135Average MAE0.044BBS-Net
2D ClassificationRGBD135S-Measure88.2BBS-Net
2D ClassificationRGBD135max E-Measure91.9BBS-Net
2D ClassificationRGBD135max F-Measure85.9BBS-Net
2D ClassificationNLPRAverage MAE0.023BBS-Net
2D ClassificationNLPRS-Measure93BBS-Net
2D ClassificationNLPRmax E-Measure96.1BBS-Net
2D ClassificationNLPRmax F-Measure91.8BBS-Net
2D ClassificationDESAverage MAE0.021BBS-Net
2D ClassificationDESS-Measure93.3BBS-Net
2D ClassificationDESmax E-Measure96.6BBS-Net
2D ClassificationDESmax F-Measure92.7BBS-Net
2D Object DetectionSTEREAverage MAE0.041BBS-Net
2D Object DetectionSTERES-Measure90.8BBS-Net
2D Object DetectionSTEREmax E-Measure94.2BBS-Net
2D Object DetectionSTEREmax F-Measure90.3BBS-Net
2D Object DetectionLFSDAverage MAE0.072BBS-Net
2D Object DetectionLFSDS-Measure86.4BBS-Net
2D Object DetectionLFSDmax E-Measure90.1BBS-Net
2D Object DetectionLFSDmax F-Measure85.8BBS-Net
2D Object DetectionSIPAverage MAE0.055BBS-Net
2D Object DetectionSIPS-Measure87.9BBS-Net
2D Object DetectionSIPmax E-Measure92.2BBS-Net
2D Object DetectionSIPmax F-Measure88.3BBS-Net
2D Object DetectionRGBD135Average MAE0.044BBS-Net
2D Object DetectionRGBD135S-Measure88.2BBS-Net
2D Object DetectionRGBD135max E-Measure91.9BBS-Net
2D Object DetectionRGBD135max F-Measure85.9BBS-Net
2D Object DetectionNLPRAverage MAE0.023BBS-Net
2D Object DetectionNLPRS-Measure93BBS-Net
2D Object DetectionNLPRmax E-Measure96.1BBS-Net
2D Object DetectionNLPRmax F-Measure91.8BBS-Net
2D Object DetectionDESAverage MAE0.021BBS-Net
2D Object DetectionDESS-Measure93.3BBS-Net
2D Object DetectionDESmax E-Measure96.6BBS-Net
2D Object DetectionDESmax F-Measure92.7BBS-Net
16kSTEREAverage MAE0.041BBS-Net
16kSTERES-Measure90.8BBS-Net
16kSTEREmax E-Measure94.2BBS-Net
16kSTEREmax F-Measure90.3BBS-Net
16kLFSDAverage MAE0.072BBS-Net
16kLFSDS-Measure86.4BBS-Net
16kLFSDmax E-Measure90.1BBS-Net
16kLFSDmax F-Measure85.8BBS-Net
16kSIPAverage MAE0.055BBS-Net
16kSIPS-Measure87.9BBS-Net
16kSIPmax E-Measure92.2BBS-Net
16kSIPmax F-Measure88.3BBS-Net
16kRGBD135Average MAE0.044BBS-Net
16kRGBD135S-Measure88.2BBS-Net
16kRGBD135max E-Measure91.9BBS-Net
16kRGBD135max F-Measure85.9BBS-Net
16kNLPRAverage MAE0.023BBS-Net
16kNLPRS-Measure93BBS-Net
16kNLPRmax E-Measure96.1BBS-Net
16kNLPRmax F-Measure91.8BBS-Net
16kDESAverage MAE0.021BBS-Net
16kDESS-Measure93.3BBS-Net
16kDESmax E-Measure96.6BBS-Net
16kDESmax F-Measure92.7BBS-Net

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07