TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Improving Image Clustering with Artifacts Attenuation via ...

Improving Image Clustering with Artifacts Attenuation via Inference-Time Attention Engineering

Kazumoto Nakamura, Yuji Nozawa, Yu-Chieh Lin, Kengo Nakata, Youyang Ng

2024-10-07Image ClusteringClustering
PaperPDF

Abstract

The goal of this paper is to improve the performance of pretrained Vision Transformer (ViT) models, particularly DINOv2, in image clustering task without requiring re-training or fine-tuning. As model size increases, high-norm artifacts anomaly appears in the patches of multi-head attention. We observe that this anomaly leads to reduced accuracy in zero-shot image clustering. These artifacts are characterized by disproportionately large values in the attention map compared to other patch tokens. To address these artifacts, we propose an approach called Inference-Time Attention Engineering (ITAE), which manipulates attention function during inference. Specifically, we identify the artifacts by investigating one of the Query-Key-Value (QKV) patches in the multi-head attention and attenuate their corresponding attention values inside the pretrained models. ITAE shows improved clustering accuracy on multiple datasets by exhibiting more expressive features in latent space. Our findings highlight the potential of ITAE as a practical solution for reducing artifacts in pretrained ViT models and improving model performance in clustering tasks without the need for re-training or fine-tuning.

Results

TaskDatasetMetricValueModel
Image ClusteringCIFAR-10ARI0.7946ITAE
Image ClusteringCIFAR-10Accuracy0.8449ITAE
Image ClusteringCIFAR-10NMI0.8682ITAE
Image ClusteringTiny-ImageNetARI0.5227ITAE
Image ClusteringTiny-ImageNetAccuracy0.6823ITAE
Image ClusteringTiny-ImageNetNMI0.8178ITAE
Image ClusteringCIFAR-100ARI0.5053ITAE
Image ClusteringCIFAR-100Accuracy0.6502ITAE
Image ClusteringCIFAR-100NMI0.771ITAE
Image ClusteringSTL-10ARI0.7594ITAE
Image ClusteringSTL-10Accuracy0.8276ITAE
Image ClusteringSTL-10NMI0.8818ITAE

Related Papers

Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18Ranking Vectors Clustering: Theory and Applications2025-07-16Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework2025-07-11GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning2025-07-09Consistency and Inconsistency in $K$-Means Clustering2025-07-08MC-INR: Efficient Encoding of Multivariate Scientific Simulation Data using Meta-Learning and Clustered Implicit Neural Representations2025-07-03Supercm: Revisiting Clustering for Semi-Supervised Learning2025-06-30Temporal Rate Reduction Clustering for Human Motion Segmentation2025-06-26