Geoffrey French, Michal Mackiewicz, Mark Fisher
This paper explores the use of self-ensembling for visual domain adaptation problems. Our technique is derived from the mean teacher variant (Tarvainen et al., 2017) of temporal ensembling (Laine et al;, 2017), a technique that achieved state of the art results in the area of semi-supervised learning. We introduce a number of modifications to their approach for challenging domain adaptation scenarios and evaluate its effectiveness. Our approach achieves state of the art results in a variety of benchmarks, including our winning entry in the VISDA-2017 visual domain adaptation challenge. In small image benchmarks, our algorithm not only outperforms prior art, but can also achieve accuracy that is close to that of a classifier trained in a supervised fashion.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Domain Adaptation | USPS-to-MNIST | Accuracy | 98.07 | Mean teacher |
| Domain Adaptation | SVHN-to-MNIST | Accuracy | 99.18 | Mean teacher |
| Domain Adaptation | Synth Signs-to-GTSRB | Accuracy | 98.66 | Mean teacher |
| Domain Adaptation | VisDA2017 | Accuracy | 85.4 | Mean teacher |
| Domain Adaptation | MNIST-to-USPS | Accuracy | 98.26 | Mean teacher |