TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Representation for Clustering via Prototype Scatt...

Learning Representation for Clustering via Prototype Scattering and Positive Sampling

Zhizhong Huang, Jie Chen, Junping Zhang, Hongming Shan

2021-11-23Deep ClusteringRepresentation LearningImage ClusteringClusteringContrastive Learning
PaperPDFCode(official)

Abstract

Existing deep clustering methods rely on either contrastive or non-contrastive representation learning for downstream clustering task. Contrastive-based methods thanks to negative pairs learn uniform representations for clustering, in which negative pairs, however, may inevitably lead to the class collision issue and consequently compromise the clustering performance. Non-contrastive-based methods, on the other hand, avoid class collision issue, but the resulting non-uniform representations may cause the collapse of clustering. To enjoy the strengths of both worlds, this paper presents a novel end-to-end deep clustering method with prototype scattering and positive sampling, termed ProPos. Specifically, we first maximize the distance between prototypical representations, named prototype scattering loss, which improves the uniformity of representations. Second, we align one augmented view of instance with the sampled neighbors of another view -- assumed to be truly positive pair in the embedding space -- to improve the within-cluster compactness, termed positive sampling alignment. The strengths of ProPos are avoidable class collision issue, uniform representations, well-separated clusters, and within-cluster compactness. By optimizing ProPos in an end-to-end expectation-maximization framework, extensive experimental results demonstrate that ProPos achieves competing performance on moderate-scale clustering benchmark datasets and establishes new state-of-the-art performance on large-scale datasets. Source code is available at \url{https://github.com/Hzzone/ProPos}.

Results

TaskDatasetMetricValueModel
Image ClusteringImageNet-10ARI0.918ProPos*
Image ClusteringImageNet-10Accuracy0.962ProPos*
Image ClusteringImageNet-10Image Size224ProPos*
Image ClusteringImageNet-10NMI0.908ProPos*
Image ClusteringImageNet-10ARI0.906ProPos
Image ClusteringImageNet-10Accuracy0.956ProPos
Image ClusteringImageNet-10Image Size96ProPos
Image ClusteringImageNet-10NMI0.896ProPos
Image ClusteringImagenet-dog-15ARI0.675ProPos*
Image ClusteringImagenet-dog-15Accuracy0.775ProPos*
Image ClusteringImagenet-dog-15Image Size224ProPos*
Image ClusteringImagenet-dog-15NMI0.737ProPos*
Image ClusteringImagenet-dog-15ARI0.627ProPos
Image ClusteringImagenet-dog-15Accuracy0.745ProPos
Image ClusteringImagenet-dog-15Image Size96ProPos
Image ClusteringImagenet-dog-15NMI0.692ProPos

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17