Don't Wait, Just Weight: Improving Unsupervised Representations by Learning Goal-Driven Instance Weights

Linus Ericsson, Henry Gouk, Timothy M. Hospedales

2020-06-22Meta-Learning Self-Supervised Learning

Abstract

In the absence of large labelled datasets, self-supervised learning techniques can boost performance by learning useful representations from unlabelled data, which is often more readily available. However, there is often a domain shift between the unlabelled collection and the downstream target problem data. We show that by learning Bayesian instance weights for the unlabelled data, we can improve the downstream classification accuracy by prioritising the most useful instances. Additionally, we show that the training time can be reduced by discarding unnecessary datapoints. Our method, BetaDataWeighter is evaluated using the popular self-supervised rotation prediction task on STL-10 and Visual Decathlon. We compare to related instance weighting schemes, both hand-designed heuristics and meta-learning, as well as conventional self-supervised learning. BetaDataWeighter achieves both the highest average accuracy and rank across datasets, and on STL-10 it prunes up to 78% of unlabelled images without significant loss in accuracy, corresponding to over 50% reduction in training time.

Results

Task	Dataset	Metric	Value	Model
Image Classification	STL-10	Percentage correct	71.12	BDW
Image Classification	STL-10	Percentage correct	69.15	NN-Weighter
Image Classification	STL-10	Percentage correct	68.19	RotNet
Image Classification	STL-10	Percentage correct	63.13	L2RW

Related Papers

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17 Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16 Imbalanced Regression Pipeline Recommendation2025-07-16 CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels2025-07-16 Mixture of Experts in Large Language Models2025-07-15 Iceberg: Enhancing HLS Modeling with Synthetic Data2025-07-14 Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder2025-07-14 Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks2025-07-13