C3: Cross-instance guided Contrastive Clustering

Mohammadreza Sadeghi, Hadi Hojjati, Narges Armanfard

2022-11-14Deep Clustering Image Clustering Clustering Contrastive Learning

Abstract

Clustering is the task of gathering similar data samples into clusters without using any predefined labels. It has been widely studied in machine learning literature, and recent advancements in deep learning have revived interest in this field. Contrastive clustering (CC) models are a staple of deep clustering in which positive and negative pairs of each data instance are generated through data augmentation. CC models aim to learn a feature space where instance-level and cluster-level representations of positive pairs are grouped together. Despite improving the SOTA, these algorithms ignore the cross-instance patterns, which carry essential information for improving clustering performance. This increases the false-negative-pair rate of the model while decreasing its true-positive-pair rate. In this paper, we propose a novel contrastive clustering method, Cross-instance guided Contrastive Clustering (C3), that considers the cross-sample relationships to increase the number of positive pairs and mitigate the impact of false negative, noise, and anomaly sample on the learned representation of data. In particular, we define a new loss function that identifies similar instances using the instance-level representation and encourages them to aggregate together. Moreover, we propose a novel weighting method to select negative samples in a more efficient way. Extensive experimental evaluations show that our proposed method can outperform state-of-the-art algorithms on benchmark computer vision datasets: we improve the clustering accuracy by 6.6%, 3.3%, 5.0%, 1.3% and 0.3% on CIFAR-10, CIFAR-100, ImageNet-10, ImageNet-Dogs, and Tiny-ImageNet.

Results

Task	Dataset	Metric	Value	Model
Image Clustering	ImageNet-10	ARI	0.861	C3
Image Clustering	ImageNet-10	Accuracy	0.942	C3
Image Clustering	ImageNet-10	NMI	0.905	C3
Image Clustering	CIFAR-10	ARI	0.707	C3
Image Clustering	CIFAR-10	Accuracy	0.838	C3
Image Clustering	CIFAR-10	NMI	0.748	C3
Image Clustering	Tiny-ImageNet	ARI	0.065	C3
Image Clustering	Tiny-ImageNet	Accuracy	0.141	C3
Image Clustering	Tiny-ImageNet	NMI	0.335	C3
Image Clustering	CIFAR-100	ARI	0.275	C3
Image Clustering	CIFAR-100	Accuracy	0.451	C3
Image Clustering	CIFAR-100	NMI	0.434	C3
Image Clustering	Imagenet-dog-15	ARI	0.28	C3
Image Clustering	Imagenet-dog-15	Accuracy	0.434	C3
Image Clustering	Imagenet-dog-15	NMI	0.448	C3

C3: Cross-instance guided Contrastive Clustering

Abstract

Results

Related Papers

C3: Cross-instance guided Contrastive Clustering

Abstract

Results

Related Papers