TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Rethinking Graph Auto-Encoder Models for Attributed Graph ...

Rethinking Graph Auto-Encoder Models for Attributed Graph Clustering

Nairouz Mrabah, Mohamed Bouguessa, Mohamed Fawzi Touati, Riadh Ksantini

2021-07-19Graph ClusteringNode ClusteringClustering
PaperPDFCode(official)

Abstract

Most recent graph clustering methods have resorted to Graph Auto-Encoders (GAEs) to perform joint clustering and embedding learning. However, two critical issues have been overlooked. First, the accumulative error, inflicted by learning with noisy clustering assignments, degrades the effectiveness and robustness of the clustering model. This problem is called Feature Randomness. Second, reconstructing the adjacency matrix sets the model to learn irrelevant similarities for the clustering task. This problem is called Feature Drift. Interestingly, the theoretical relation between the aforementioned problems has not yet been investigated. We study these issues from two aspects: (1) there is a trade-off between Feature Randomness and Feature Drift when clustering and reconstruction are performed at the same level, and (2) the problem of Feature Drift is more pronounced for GAE models, compared with vanilla auto-encoder models, due to the graph convolutional operation and the graph decoding design. Motivated by these findings, we reformulate the GAE-based clustering methodology. Our solution is two-fold. First, we propose a sampling operator $\Xi$ that triggers a protection mechanism against the noisy clustering assignments. Second, we propose an operator $\Upsilon$ that triggers a correction mechanism against Feature Drift by gradually transforming the reconstructed graph into a clustering-oriented one. As principal advantages, our solution grants a considerable improvement in clustering effectiveness and robustness and can be easily tailored to existing GAE models.

Results

TaskDatasetMetricValueModel
Graph ClusteringPubmedACC74R-GMM-VGAE
Graph ClusteringPubmedARI37.9R-GMM-VGAE
Graph ClusteringPubmedNMI33.4R-GMM-VGAE
Graph ClusteringPubmedACC71.4R-DGAE
Graph ClusteringPubmedARI34.6R-DGAE
Graph ClusteringPubmedNMI34.4R-DGAE
Graph ClusteringCoraACC76.7R-GMM-VGAE
Graph ClusteringCoraARI57.9R-GMM-VGAE
Graph ClusteringCoraNMI57.3R-GMM-VGAE
Graph ClusteringCoraACC73.7R-DGAE
Graph ClusteringCoraARI54.1R-DGAE
Graph ClusteringCoraNMI56R-DGAE
Graph ClusteringCiteseerACC70.5R-DGAE
Graph ClusteringCiteseerARI47.1R-DGAE
Graph ClusteringCiteseerNMI45R-DGAE
Graph ClusteringCiteseerACC68.9R-GMM-VGAE
Graph ClusteringCiteseerARI43.9R-GMM-VGAE
Graph ClusteringCiteseerNMI42R-GMM-VGAE

Related Papers

Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18Ranking Vectors Clustering: Theory and Applications2025-07-16Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework2025-07-11GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning2025-07-09Consistency and Inconsistency in $K$-Means Clustering2025-07-08MC-INR: Efficient Encoding of Multivariate Scientific Simulation Data using Meta-Learning and Clustered Implicit Neural Representations2025-07-03Supercm: Revisiting Clustering for Semi-Supervised Learning2025-06-30Temporal Rate Reduction Clustering for Human Motion Segmentation2025-06-26