TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Robust Scene Change Detection Using Visual Foundation Mode...

Robust Scene Change Detection Using Visual Foundation Models and Cross-Attention Mechanisms

Chun-Jung Lin, Sourav Garg, Tat-Jun Chin, Feras Dayoub

2024-09-25Change DetectionScene Change Detection
PaperPDFCode(official)

Abstract

We present a novel method for scene change detection that leverages the robust feature extraction capabilities of a visual foundational model, DINOv2, and integrates full-image cross-attention to address key challenges such as varying lighting, seasonal variations, and viewpoint differences. In order to effectively learn correspondences and mis-correspondences between an image pair for the change detection task, we propose to a) ``freeze'' the backbone in order to retain the generality of dense foundation features, and b) employ ``full-image'' cross-attention to better tackle the viewpoint variations between the image pair. We evaluate our approach on two benchmark datasets, VL-CMU-CD and PSCD, along with their viewpoint-varied versions. Our experiments demonstrate significant improvements in F1-score, particularly in scenarios involving geometric changes between image pairs. The results indicate our method's superior generalization capabilities over existing state-of-the-art approaches, showing robustness against photometric and geometric variations as well as better overall generalization when fine-tuned to adapt to new environments. Detailed ablation studies further validate the contributions of each component in our architecture. Source code will be made publicly available upon acceptance.

Results

TaskDatasetMetricValueModel
Scene Change DetectionUnaligned-VL-CMU-CD (neighbor distance 2)F1-score0.784Robust-Scene-Change-Detection (Diff-View Augmentation)
Scene Change DetectionUnaligned-VL-CMU-CD (neighbor distance 2)F1-score0.739Robust-Scene-Change-Detection

Related Papers

Precision Spatio-Temporal Feature Fusion for Robust Remote Sensing Change Detection2025-07-15Be the Change You Want to See: Revisiting Remote Sensing Change Detection Practices2025-07-04Be the Change You Want to See: Revisiting Remote Sensing Change Detection Practices2025-07-04Pushing Trade-Off Boundaries: Compact yet Effective Remote Sensing Change Detection2025-06-26CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization2025-06-26HydroChronos: Forecasting Decades of Surface Water Change2025-06-17Active InSAR monitoring of building damage in Gaza during the Israel-Hamas War2025-06-17Revisiting Clustering of Neural Bandits: Selective Reinitialization for Mitigating Loss of Plasticity2025-06-14