TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CoLA: Conditional Dropout and Language-driven Robust Dual-...

CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection

Shuang Hao, Chunlin Zhong, He Tang

2024-07-09Salient Object DetectionRGB-D Salient Object Detectionobject-detectionObject DetectionRGB Salient Object DetectionLanguage Modelling
PaperPDFCode(official)

Abstract

The depth/thermal information is beneficial for detecting salient object with conventional RGB images. However, in dual-modal salient object detection (SOD) model, the robustness against noisy inputs and modality missing is crucial but rarely studied. To tackle this problem, we introduce \textbf{Co}nditional Dropout and \textbf{LA}nguage-driven(\textbf{CoLA}) framework comprising two core components. 1) Language-driven Quality Assessment (LQA): Leveraging a pretrained vision-language model with a prompt learner, the LQA recalibrates image contributions without requiring additional quality annotations. This approach effectively mitigates the impact of noisy inputs. 2) Conditional Dropout (CD): A learning method to strengthen the model's adaptability in scenarios with missing modalities, while preserving its performance under complete modalities. The CD serves as a plug-in training scheme that treats modality-missing as conditions, strengthening the overall robustness of various dual-modal SOD models. Extensive experiments demonstrate that the proposed method outperforms state-of-the-art dual-modal SOD models, under both modality-complete and modality-missing conditions. We will release source code upon acceptance.

Results

TaskDatasetMetricValueModel
Object DetectionNJU2KAverage MAE0.029CoLANet
Object DetectionNJU2KS-Measure93.4CoLANet
Object DetectionNJU2Kmax E-Measure94.7CoLANet
Object DetectionNJU2Kmax F-Measure91.3CoLANet
Object DetectionSTEREAverage MAE0.039CoLANet
Object DetectionSTERES-Measure90.8CoLANet
Object DetectionSTEREmax E-Measure94.1CoLANet
Object DetectionSTEREmax F-Measure88.9CoLANet
Object DetectionSIPAverage MAE0.042CoLANet
Object DetectionSIPS-Measure89.5CoLANet
Object DetectionSIPmax E-Measure93.5CoLANet
Object DetectionSIPmax F-Measure89.4CoLANet
Object DetectionNLPRAverage MAE0.021CoLANet
Object DetectionNLPRS-Measure93.5CoLANet
Object DetectionNLPRmax E-Measure95.7CoLANet
Object DetectionNLPRmax F-Measure90.9CoLANet
Object DetectionDESAverage MAE0.018CoLANet
Object DetectionDESS-Measure93.5CoLANet
Object DetectionDESmax E-Measure96.3CoLANet
Object DetectionDESmax F-Measure92.5CoLANet
3DNJU2KAverage MAE0.029CoLANet
3DNJU2KS-Measure93.4CoLANet
3DNJU2Kmax E-Measure94.7CoLANet
3DNJU2Kmax F-Measure91.3CoLANet
3DSTEREAverage MAE0.039CoLANet
3DSTERES-Measure90.8CoLANet
3DSTEREmax E-Measure94.1CoLANet
3DSTEREmax F-Measure88.9CoLANet
3DSIPAverage MAE0.042CoLANet
3DSIPS-Measure89.5CoLANet
3DSIPmax E-Measure93.5CoLANet
3DSIPmax F-Measure89.4CoLANet
3DNLPRAverage MAE0.021CoLANet
3DNLPRS-Measure93.5CoLANet
3DNLPRmax E-Measure95.7CoLANet
3DNLPRmax F-Measure90.9CoLANet
3DDESAverage MAE0.018CoLANet
3DDESS-Measure93.5CoLANet
3DDESmax E-Measure96.3CoLANet
3DDESmax F-Measure92.5CoLANet
2D ClassificationNJU2KAverage MAE0.029CoLANet
2D ClassificationNJU2KS-Measure93.4CoLANet
2D ClassificationNJU2Kmax E-Measure94.7CoLANet
2D ClassificationNJU2Kmax F-Measure91.3CoLANet
2D ClassificationSTEREAverage MAE0.039CoLANet
2D ClassificationSTERES-Measure90.8CoLANet
2D ClassificationSTEREmax E-Measure94.1CoLANet
2D ClassificationSTEREmax F-Measure88.9CoLANet
2D ClassificationSIPAverage MAE0.042CoLANet
2D ClassificationSIPS-Measure89.5CoLANet
2D ClassificationSIPmax E-Measure93.5CoLANet
2D ClassificationSIPmax F-Measure89.4CoLANet
2D ClassificationNLPRAverage MAE0.021CoLANet
2D ClassificationNLPRS-Measure93.5CoLANet
2D ClassificationNLPRmax E-Measure95.7CoLANet
2D ClassificationNLPRmax F-Measure90.9CoLANet
2D ClassificationDESAverage MAE0.018CoLANet
2D ClassificationDESS-Measure93.5CoLANet
2D ClassificationDESmax E-Measure96.3CoLANet
2D ClassificationDESmax F-Measure92.5CoLANet
2D Object DetectionNJU2KAverage MAE0.029CoLANet
2D Object DetectionNJU2KS-Measure93.4CoLANet
2D Object DetectionNJU2Kmax E-Measure94.7CoLANet
2D Object DetectionNJU2Kmax F-Measure91.3CoLANet
2D Object DetectionSTEREAverage MAE0.039CoLANet
2D Object DetectionSTERES-Measure90.8CoLANet
2D Object DetectionSTEREmax E-Measure94.1CoLANet
2D Object DetectionSTEREmax F-Measure88.9CoLANet
2D Object DetectionSIPAverage MAE0.042CoLANet
2D Object DetectionSIPS-Measure89.5CoLANet
2D Object DetectionSIPmax E-Measure93.5CoLANet
2D Object DetectionSIPmax F-Measure89.4CoLANet
2D Object DetectionNLPRAverage MAE0.021CoLANet
2D Object DetectionNLPRS-Measure93.5CoLANet
2D Object DetectionNLPRmax E-Measure95.7CoLANet
2D Object DetectionNLPRmax F-Measure90.9CoLANet
2D Object DetectionDESAverage MAE0.018CoLANet
2D Object DetectionDESS-Measure93.5CoLANet
2D Object DetectionDESmax E-Measure96.3CoLANet
2D Object DetectionDESmax F-Measure92.5CoLANet
16kNJU2KAverage MAE0.029CoLANet
16kNJU2KS-Measure93.4CoLANet
16kNJU2Kmax E-Measure94.7CoLANet
16kNJU2Kmax F-Measure91.3CoLANet
16kSTEREAverage MAE0.039CoLANet
16kSTERES-Measure90.8CoLANet
16kSTEREmax E-Measure94.1CoLANet
16kSTEREmax F-Measure88.9CoLANet
16kSIPAverage MAE0.042CoLANet
16kSIPS-Measure89.5CoLANet
16kSIPmax E-Measure93.5CoLANet
16kSIPmax F-Measure89.4CoLANet
16kNLPRAverage MAE0.021CoLANet
16kNLPRS-Measure93.5CoLANet
16kNLPRmax E-Measure95.7CoLANet
16kNLPRmax F-Measure90.9CoLANet
16kDESAverage MAE0.018CoLANet
16kDESS-Measure93.5CoLANet
16kDESmax E-Measure96.3CoLANet
16kDESmax F-Measure92.5CoLANet

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17