Label Deconvolution for Node Representation Learning on Large-scale Attributed Graphs against Learning Bias

Zhihao Shi, Jie Wang, Fanghua Lu, Hanzhu Chen, Defu Lian, Zheng Wang, Jieping Ye, Feng Wu

2023-09-26Representation Learning

Abstract

Node representation learning on attributed graphs -- whose nodes are associated with rich attributes (e.g., texts and protein sequences) -- plays a crucial role in many important downstream tasks. To encode the attributes and graph structures simultaneously, recent studies integrate pre-trained models with graph neural networks (GNNs), where pre-trained models serve as node encoders (NEs) to encode the attributes. As jointly training large NEs and GNNs on large-scale graphs suffers from severe scalability issues, many methods propose to train NEs and GNNs separately. Consequently, they do not take feature convolutions in GNNs into consideration in the training phase of NEs, leading to a significant learning bias from that by the joint training. To address this challenge, we propose an efficient label regularization technique, namely Label Deconvolution (LD), to alleviate the learning bias by a novel and highly scalable approximation to the inverse mapping of GNNs. The inverse mapping leads to an objective function that is equivalent to that by the joint training, while it can effectively incorporate GNNs in the training phase of NEs against the learning bias. More importantly, we show that LD converges to the optimal objective function values by thejoint training under mild assumptions. Experiments demonstrate LD significantly outperforms state-of-the-art methods on Open Graph Benchmark datasets.

Results

Task	Dataset	Metric	Value	Model
Node Property Prediction	ogbn-arxiv	Number of params	140438868	LD+REVGAT
Node Property Prediction	ogbn-products	Number of params	110636896	LD+GIANT+SAGN+SCR
Node Property Prediction	ogbn-products	Number of params	144331677	LD+GAMLP
Node Property Prediction	ogbn-proteins	Number of params	664233700	LD+GAT

Label Deconvolution for Node Representation Learning on Large-scale Attributed Graphs against Learning Bias

Abstract

Results

Related Papers

Label Deconvolution for Node Representation Learning on Large-scale Attributed Graphs against Learning Bias

Abstract

Results

Related Papers