TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Twin Contrastive Learning for Online Clustering

Twin Contrastive Learning for Online Clustering

Yunfan Li, Mouxing Yang, Dezhong Peng, Taihao Li, Jiantao Huang, Xi Peng

2022-10-21Short Text ClusteringDeep ClusteringOnline ClusteringImage ClusteringClusteringContrastive Learning
PaperPDFCode(official)Code

Abstract

This paper proposes to perform online clustering by conducting twin contrastive learning (TCL) at the instance and cluster level. Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively. Based on the observation, for a given dataset, the proposed TCL first constructs positive and negative pairs through data augmentations. Thereafter, in the row and column space of the feature matrix, instance- and cluster-level contrastive learning are respectively conducted by pulling together positive pairs while pushing apart the negatives. To alleviate the influence of intrinsic false-negative pairs and rectify cluster assignments, we adopt a confidence-based criterion to select pseudo-labels for boosting both the instance- and cluster-level contrastive learning. As a result, the clustering performance is further improved. Besides the elegant idea of twin contrastive learning, another advantage of TCL is that it could independently predict the cluster assignment for each instance, thus effortlessly fitting online scenarios. Extensive experiments on six widely-used image and text benchmarks demonstrate the effectiveness of TCL. The code will be released on GitHub.

Results

TaskDatasetMetricValueModel
Text ClusteringStackoverflowAcc88.2TCL
Text ClusteringStackoverflowNMI0.786TCL
Text ClusteringBiomedicalAcc49.8TCL
Text ClusteringBiomedicalNMI42.9TCL
Image ClusteringImageNet-10ARI0.837TCL
Image ClusteringImageNet-10Accuracy0.895TCL
Image ClusteringImageNet-10NMI0.875TCL
Image ClusteringCIFAR-10ARI0.78TCL
Image ClusteringCIFAR-10Accuracy0.887TCL
Image ClusteringCIFAR-10NMI0.819TCL
Image ClusteringCIFAR-100ARI0.357TCL
Image ClusteringCIFAR-100Accuracy0.531TCL
Image ClusteringCIFAR-100NMI0.529TCL
Image ClusteringSTL-10ARI0.757TCL
Image ClusteringSTL-10Accuracy0.868TCL
Image ClusteringSTL-10NMI0.799TCL
Image ClusteringImagenet-dog-15ARI0.516TCL
Image ClusteringImagenet-dog-15Accuracy0.644TCL
Image ClusteringImagenet-dog-15NMI0.623TCL

Related Papers

Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Ranking Vectors Clustering: Theory and Applications2025-07-16Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation2025-07-15