TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Entropy is not Enough for Test-Time Adaptation: From the P...

Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors

Jonghyun Lee, Dahuin Jung, Saehyung Lee, Junsung Park, Juhyeon Shin, Uiwon Hwang, Sungroh Yoon

2024-03-12Test-time Adaptation
PaperPDFCode(official)

Abstract

Test-time adaptation (TTA) fine-tunes pre-trained deep neural networks for unseen test data. The primary challenge of TTA is limited access to the entire test dataset during online updates, causing error accumulation. To mitigate it, TTA methods have utilized the model output's entropy as a confidence metric that aims to determine which samples have a lower likelihood of causing error. Through experimental studies, however, we observed the unreliability of entropy as a confidence metric for TTA under biased scenarios and theoretically revealed that it stems from the neglect of the influence of latent disentangled factors of data on predictions. Building upon these findings, we introduce a novel TTA method named Destroy Your Object (DeYO), which leverages a newly proposed confidence metric named Pseudo-Label Probability Difference (PLPD). PLPD quantifies the influence of the shape of an object on prediction by measuring the difference between predictions before and after applying an object-destructive transformation. DeYO consists of sample selection and sample weighting, which employ entropy and PLPD concurrently. For robust adaptation, DeYO prioritizes samples that dominantly incorporate shape information when making predictions. Our extensive experiments demonstrate the consistent superiority of DeYO over baseline methods across various scenarios, including biased and wild. Project page is publicly available at https://whitesnowdrop.github.io/DeYO/.

Results

TaskDatasetMetricValueModel
Domain AdaptationImageNet-CMean Accuracy48.6DeYO

Related Papers

Helping CLIP See Both the Forest and the Trees: A Decomposition and Description Approach2025-07-04DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation2025-06-29Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement2025-06-25Keeping Medical AI Healthy: A Review of Detection and Correction Methods for System Degradation2025-06-20Exploring Audio Cues for Enhanced Test-Time Video Model Adaptation2025-06-14Test-Time Adaptation for Generalizable Task Progress Estimation2025-06-11SUTA-LM: Bridging Test-Time Adaptation and Language Model Rescoring for Robust ASR2025-06-10E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models2025-06-08