Xinlei Chen, Haoqi Fan, Ross Girshick, Kaiming He
Contrastive unsupervised learning has recently shown encouraging progress, e.g., in Momentum Contrast (MoCo) and SimCLR. In this note, we verify the effectiveness of two of SimCLR's design improvements by implementing them in the MoCo framework. With simple modifications to MoCo---namely, using an MLP projection head and more data augmentation---we establish stronger baselines that outperform SimCLR and do not require large training batches. We hope this will make state-of-the-art unsupervised learning research more accessible. Code will be made public.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Person Re-Identification | SYSU-30k | Rank-1 | 11.6 | MoCo v2 (self-supervised) |
| Person Re-Identification | SYSU-30k | Rank-1 | 11.6 | MoCo v2 |
| Image Classification | Places205 | Top 1 Accuracy | 52.9 | MoCo v2 |
| Contrastive Learning | imagenet-1k | ImageNet Top-1 Accuracy | 71.1 | ResNet50 |