TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Making Convolutional Networks Shift-Invariant Again

Making Convolutional Networks Shift-Invariant Again

Richard Zhang

2019-04-25Image ClassificationDomain GeneralizationImage GenerationConditional Image Generation
PaperPDFCodeCodeCodeCodeCode(official)CodeCode

Abstract

Modern convolutional networks are not shift-invariant, as small input shifts or translations can cause drastic changes in the output. Commonly used downsampling methods, such as max-pooling, strided-convolution, and average-pooling, ignore the sampling theorem. The well-known signal processing fix is anti-aliasing by low-pass filtering before downsampling. However, simply inserting this module into deep networks degrades performance; as a result, it is seldomly used today. We show that when integrated correctly, it is compatible with existing architectural components, such as max-pooling and strided-convolution. We observe \textit{increased accuracy} in ImageNet classification, across several commonly-used architectures, such as ResNet, DenseNet, and MobileNet, indicating effective regularization. Furthermore, we observe \textit{better generalization}, in terms of stability and robustness to input corruptions. Our results demonstrate that this classical signal processing technique has been undeservingly overlooked in modern deep networks. Code and anti-aliased versions of popular networks are available at https://richzhang.github.io/antialiased-cnns/ .

Results

TaskDatasetMetricValueModel
Domain AdaptationVizWiz-ClassificationAccuracy - All Images41.7ResNet-101 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images46.1ResNet-101 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images35.7ResNet-101 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images41.5ResNet-50 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images45.3ResNet-50 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images35.2ResNet-50 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images41.1ResNet-101 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images45.2ResNet-101 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images35.1ResNet-101 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images41ResNet-101 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images45.8ResNet-101 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images34.8ResNet-101 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images40.3ResNet-50 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images45.1ResNet-50 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images33.4ResNet-50 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images40ResNet-50 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images44.7ResNet-50 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images34.3ResNet-50 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images38.7DenseNet121 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images42.7DenseNet121 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images32DenseNet121 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images38.3ResNet-34 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images42.8ResNet-34 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images32.4ResNet-34 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images38.3DenseNet-121 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images42.8DenseNet-121 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images32.3DenseNet-121 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images38.3ResNet-34 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images42.9ResNet-34 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images31.9ResNet-34 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images38.3DenseNet-121 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images43.1DenseNet-121 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images31.7DenseNet-121 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images37.2VGG-16 BN (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images41.8VGG-16 BN (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images31.3VGG-16 BN (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images37.2ResNet-34 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images42.5ResNet-34 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images29.9ResNet-34 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images37VGG-16 BN (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images41.7VGG-16 BN (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images30.8VGG-16 BN (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images36.9VGG-16 BN (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images42.1VGG-16 BN (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images30.6VGG-16 BN (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images36MobileNetV2 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images40.3MobileNetV2 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images30.4MobileNetV2 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images35.8MobileNetV2 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images40.1MobileNetV2 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images29.1MobileNetV2 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images35.6ResNet-18 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images39.5ResNet-18 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images28.5ResNet-18 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images35.5MobileNetV2 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images39.2MobileNetV2 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images30.3MobileNetV2 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images35.5ResNet-18 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images40.1ResNet-18 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images28.7ResNet-18 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images35.1VGG-16 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images40VGG-16 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images28.2VGG-16 (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images34.7ResNet-18 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images38.9ResNet-18 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images27.7ResNet-18 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images34.5VGG-16 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images39.4VGG-16 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images27.8VGG-16 (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images33.5VGG-16 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images38.5VGG-16 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images26.7VGG-16 (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images23.1AlexNet (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images26.8AlexNet (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images17.5AlexNet (lpf3)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images22.8AlexNet (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images26.8AlexNet (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images18.2AlexNet (lpf2)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images22.7AlexNet (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images26.8AlexNet (lpf5)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images18.4AlexNet (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images41.7ResNet-101 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images46.1ResNet-101 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images35.7ResNet-101 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images41.5ResNet-50 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images45.3ResNet-50 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images35.2ResNet-50 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images41.1ResNet-101 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images45.2ResNet-101 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images35.1ResNet-101 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images41ResNet-101 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images45.8ResNet-101 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images34.8ResNet-101 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images40.3ResNet-50 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images45.1ResNet-50 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images33.4ResNet-50 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images40ResNet-50 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images44.7ResNet-50 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images34.3ResNet-50 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images38.7DenseNet121 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images42.7DenseNet121 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images32DenseNet121 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images38.3ResNet-34 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images42.8ResNet-34 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images32.4ResNet-34 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images38.3DenseNet-121 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images42.8DenseNet-121 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images32.3DenseNet-121 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images38.3ResNet-34 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images42.9ResNet-34 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images31.9ResNet-34 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images38.3DenseNet-121 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images43.1DenseNet-121 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images31.7DenseNet-121 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images37.2VGG-16 BN (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images41.8VGG-16 BN (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images31.3VGG-16 BN (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images37.2ResNet-34 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images42.5ResNet-34 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images29.9ResNet-34 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images37VGG-16 BN (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images41.7VGG-16 BN (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images30.8VGG-16 BN (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images36.9VGG-16 BN (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images42.1VGG-16 BN (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images30.6VGG-16 BN (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images36MobileNetV2 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images40.3MobileNetV2 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images30.4MobileNetV2 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images35.8MobileNetV2 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images40.1MobileNetV2 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images29.1MobileNetV2 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images35.6ResNet-18 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images39.5ResNet-18 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images28.5ResNet-18 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images35.5MobileNetV2 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images39.2MobileNetV2 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images30.3MobileNetV2 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images35.5ResNet-18 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images40.1ResNet-18 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images28.7ResNet-18 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images35.1VGG-16 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images40VGG-16 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images28.2VGG-16 (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images34.7ResNet-18 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images38.9ResNet-18 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images27.7ResNet-18 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images34.5VGG-16 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images39.4VGG-16 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images27.8VGG-16 (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images33.5VGG-16 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images38.5VGG-16 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images26.7VGG-16 (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images23.1AlexNet (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images26.8AlexNet (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images17.5AlexNet (lpf3)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images22.8AlexNet (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images26.8AlexNet (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images18.2AlexNet (lpf2)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images22.7AlexNet (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images26.8AlexNet (lpf5)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images18.4AlexNet (lpf5)

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17