Graph Propagation Transformer for Graph Representation Learning

Zhe Chen, Hao Tan, Tao Wang, Tianrun Shen, Tong Lu, Qiuying Peng, Cheng Cheng, Yue Qi

2023-05-19Graph Representation Learning Representation Learning Graph Regression Graph Learning Node Classification Graph Property Prediction

Paper PDF Code(official)

Abstract

This paper presents a novel transformer architecture for graph representation learning. The core insight of our method is to fully consider the information propagation among nodes and edges in a graph when building the attention module in the transformer blocks. Specifically, we propose a new attention mechanism called Graph Propagation Attention (GPA). It explicitly passes the information among nodes and edges in three ways, i.e. node-to-node, node-to-edge, and edge-to-node, which is essential for learning graph-structured data. On this basis, we design an effective transformer architecture named Graph Propagation Transformer (GPTrans) to further help learn graph data. We verify the performance of GPTrans in a wide range of graph learning experiments on several benchmark datasets. These results show that our method outperforms many state-of-the-art transformer-based graph models with better performance. The code will be released at https://github.com/czczup/GPTrans.

Results

Task	Dataset	Metric	Value	Model
Graph Regression	PCQM4Mv2-LSC	Test MAE	0.0821	GPTrans-L
Graph Regression	PCQM4Mv2-LSC	Validation MAE	0.0809	GPTrans-L
Graph Regression	PCQM4Mv2-LSC	Test MAE	0.0842	GPTrans-T
Graph Regression	PCQM4Mv2-LSC	Validation MAE	0.0833	GPTrans-T
Graph Regression	ZINC-500k	MAE	0.077	GPTrans-Nano
Graph Regression	PCQM4M-LSC	Validation MAE	0.1151	GPTrans-L
Node Classification	CLUSTER	Accuracy	78.07	GPTrans-Nano

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20 SMART: Relation-Aware Learning of Geometric Representations for Knowledge Graphs2025-07-17 Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17 Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17 SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17 Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16 Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16 Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16