TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Active Learning on a Budget: Opposite Strategies Suit High...

Active Learning on a Budget: Opposite Strategies Suit High and Low Budgets

Guy Hacohen, Avihu Dekel, Daphna Weinshall

2022-02-06Active Learning
PaperPDFCode(official)

Abstract

Investigating active learning, we focus on the relation between the number of labeled examples (budget size), and suitable querying strategies. Our theoretical analysis shows a behavior reminiscent of phase transition: typical examples are best queried when the budget is low, while unrepresentative examples are best queried when the budget is large. Combined evidence shows that a similar phenomenon occurs in common classification models. Accordingly, we propose TypiClust -- a deep active learning strategy suited for low budgets. In a comparative empirical investigation of supervised learning, using a variety of architectures and image datasets, TypiClust outperforms all other active learning strategies in the low-budget regime. Using TypiClust in the semi-supervised framework, performance gets an even more significant boost. In particular, state-of-the-art semi-supervised methods trained on CIFAR-10 with 10 labeled examples selected by TypiClust, reach 93.2% accuracy -- an improvement of 39.4% over random selection. Code is available at https://github.com/avihu111/TypiClust.

Results

TaskDatasetMetricValueModel
Optical Character Recognition (OCR)CIFAR10 (10,000)Accuracy93.2TypiClust
Active LearningCIFAR10 (10,000)Accuracy93.2TypiClust

Related Papers

A Risk-Aware Adaptive Robust MPC with Learned Uncertainty Quantification2025-07-15CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization2025-07-08MP-ALOE: An r2SCAN dataset for universal machine learning interatomic potentials2025-07-08Active Learning for Manifold Gaussian Process Regression2025-06-26Machine-Learning-Assisted Photonic Device Development: A Multiscale Approach from Theory to Characterization2025-06-24Active Learning-Guided Seq2Seq Variational Autoencoder for Multi-target Inhibitor Generation2025-06-18Bayesian Active Learning of (small) Quantile Sets through Expected Estimator Modification2025-06-16Coupled reaction and diffusion governing interface evolution in solid-state batteries2025-06-12