Evgenia Rusak, Steffen Schneider, George Pachitariu, Luisa Eck, Peter Gehler, Oliver Bringmann, Wieland Brendel, Matthias Bethge
We demonstrate that self-learning techniques like entropy minimization and pseudo-labeling are simple and effective at improving performance of a deployed computer vision model under systematic domain shifts. We conduct a wide range of large-scale experiments and show consistent improvements irrespective of the model architecture, the pre-training technique or the type of distribution shift. At the same time, self-learning is simple to use in practice because it does not require knowledge or access to the original training data or scheme, is robust to hyperparameter choices, is straight-forward to implement and requires only a few adaptation epochs. This makes self-learning techniques highly attractive for any practitioner who applies machine learning algorithms in the real world. We present state-of-the-art adaptation results on CIFAR10-C (8.5% error), ImageNet-C (22.0% mCE), ImageNet-R (17.4% error) and ImageNet-A (14.8% error), theoretically study the dynamics of self-supervised adaptation methods and propose a new classification dataset (ImageNet-D) which is challenging even with adaptation.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Domain Adaptation | ImageNet-A | Top 1 Error | 14.8 | EfficientNet-L2 NoisyStudent + RPL |
| Domain Adaptation | ImageNet-R | Top 1 Error | 17.4 | EfficientNet-L2 Noisy Student + RPL |
| Domain Adaptation | ImageNet-R | Top 1 Error | 19.7 | EfficientNet-L2 Noisy Student + ENT |
| Domain Adaptation | ImageNet-R | Top 1 Error | 54.1 | ResNet50 + RPL |
| Domain Adaptation | ImageNet-R | Top 1 Error | 56.1 | ResNet50 + ENT |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 22 | EfficientNet-L2+RPL |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 23 | EfficientNet-L2+ENT |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 34.8 | ResNeXt101 32x8d + DeepAug + Augmix + RPL |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 35.5 | ResNeXt101 32x8d + DeepAug + Augmix + ENT |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 40.8 | ResNeXt101 32x8d + IG-3.5B + ENT |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 40.9 | ResNeXt101 32x8d + IG-3.5B + RPL |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 43.2 | ResNeXt101 32x8d + RPL |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 44.3 | ResNeXt101 32x8d + ENT |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 50.5 | ResNet50 + RPL |
| Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 51.6 | ResNet50 + ENT |
| Unsupervised Domain Adaptation | ImageNet-A | Top 1 Error | 14.8 | EfficientNet-L2 NoisyStudent + RPL |
| Unsupervised Domain Adaptation | ImageNet-R | Top 1 Error | 17.4 | EfficientNet-L2 Noisy Student + RPL |
| Unsupervised Domain Adaptation | ImageNet-R | Top 1 Error | 19.7 | EfficientNet-L2 Noisy Student + ENT |
| Unsupervised Domain Adaptation | ImageNet-R | Top 1 Error | 54.1 | ResNet50 + RPL |
| Unsupervised Domain Adaptation | ImageNet-R | Top 1 Error | 56.1 | ResNet50 + ENT |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 22 | EfficientNet-L2+RPL |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 23 | EfficientNet-L2+ENT |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 34.8 | ResNeXt101 32x8d + DeepAug + Augmix + RPL |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 35.5 | ResNeXt101 32x8d + DeepAug + Augmix + ENT |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 40.8 | ResNeXt101 32x8d + IG-3.5B + ENT |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 40.9 | ResNeXt101 32x8d + IG-3.5B + RPL |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 43.2 | ResNeXt101 32x8d + RPL |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 44.3 | ResNeXt101 32x8d + ENT |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 50.5 | ResNet50 + RPL |
| Unsupervised Domain Adaptation | ImageNet-C | mean Corruption Error (mCE) | 51.6 | ResNet50 + ENT |