TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Amplitude-Phase Recombination: Rethinking Robustness of Co...

Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

Guangyao Chen, Peixi Peng, Li Ma, Jia Li, Lin Du, Yonghong Tian

2021-08-19ICCV 2021 10Data AugmentationDomain GeneralizationOut-of-Distribution DetectionAdversarial Attack
PaperPDFCode(official)

Abstract

Recently, the generalization behavior of Convolutional Neural Networks (CNN) is gradually transparent through explanation techniques with the frequency components decomposition. However, the importance of the phase spectrum of the image for a robust vision system is still ignored. In this paper, we notice that the CNN tends to converge at the local optimum which is closely related to the high-frequency components of the training images, while the amplitude spectrum is easily disturbed such as noises or common corruptions. In contrast, more empirical studies found that humans rely on more phase components to achieve robust recognition. This observation leads to more explanations of the CNN's generalization behaviors in both robustness to common perturbations and out-of-distribution detection, and motivates a new perspective on data augmentation designed by re-combing the phase spectrum of the current image and the amplitude spectrum of the distracter image. That is, the generated samples force the CNN to pay more attention to the structured information from phase components and keep robust to the variation of the amplitude. Experiments on several image datasets indicate that the proposed method achieves state-of-the-art performances on multiple generalizations and calibration tasks, including adaptability for common corruptions and surface variations, out-of-distribution detection, and adversarial attack.

Results

TaskDatasetMetricValueModel
Domain AdaptationImageNet-Cmean Corruption Error (mCE)57.5APR-SP + DeepAugment (ResNet-50)
Domain AdaptationImageNet-Cmean Corruption Error (mCE)65APR-SP (ResNet-50)
Out-of-Distribution DetectionCIFAR-10AUROC98.1ResNet18 + APR-P
Domain GeneralizationImageNet-Cmean Corruption Error (mCE)57.5APR-SP + DeepAugment (ResNet-50)
Domain GeneralizationImageNet-Cmean Corruption Error (mCE)65APR-SP (ResNet-50)

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofing2025-07-16Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15