Xiaocheng Yang, Mingyu Yan, Shirui Pan, Xiaochun Ye, Dongrui Fan
Heterogeneous graph neural networks (HGNNs) have powerful capability to embed rich structural and semantic information of a heterogeneous graph into node representations. Existing HGNNs inherit many mechanisms from graph neural networks (GNNs) over homogeneous graphs, especially the attention mechanism and the multi-layer structure. These mechanisms bring excessive complexity, but seldom work studies whether they are really effective on heterogeneous graphs. This paper conducts an in-depth and detailed study of these mechanisms and proposes Simple and Efficient Heterogeneous Graph Neural Network (SeHGNN). To easily capture structural information, SeHGNN pre-computes the neighbor aggregation using a light-weight mean aggregator, which reduces complexity by removing overused neighbor attention and avoiding repeated neighbor aggregation in every training epoch. To better utilize semantic information, SeHGNN adopts the single-layer structure with long metapaths to extend the receptive field, as well as a transformer-based semantic fusion module to fuse features from different metapaths. As a result, SeHGNN exhibits the characteristics of simple network structure, high prediction accuracy, and fast training speed. Extensive experiments on five real-world heterogeneous graphs demonstrate the superiority of SeHGNN over the state-of-the-arts on both accuracy and training speed.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Node Classification | IMDB (Heterogeneous Node Classification) | Macro-F1 | 66.63 | SeHGNN |
| Node Classification | IMDB (Heterogeneous Node Classification) | Micro-F1 | 68.21 | SeHGNN |
| Node Classification | Freebase (Heterogeneous Node Classification) | Macro-F1 | 50.71 | SeHGNN |
| Node Classification | Freebase (Heterogeneous Node Classification) | Micro-F1 | 63.41 | SeHGNN |
| Node Classification | DBLP (Heterogeneous Node Classification) | Macro-F1 | 94.86 | SeHGNN |
| Node Classification | DBLP (Heterogeneous Node Classification) | Micro-F1 | 95.24 | SeHGNN |
| Node Classification | ACM (Heterogeneous Node Classification) | Claim Classification Macro-F1 | 93.95 | SeHGNN |
| Node Classification | ACM (Heterogeneous Node Classification) | Micro-F1 | 93.87 | SeHGNN |
| Node Classification | OAG-Venue | MRR | 29.11 | SeHGNN |
| Node Classification | OAG-Venue | NDCG | 46.75 | SeHGNN |
| Node Classification | OAG-L1-Field | MRR | 84.95 | SeHGNN |
| Node Classification | OAG-L1-Field | NDCG | 86.01 | SeHGNN |
| Node Property Prediction | ogbn-mag | Number of params | 8371231 | SeHGNN (ComplEx embs) |
| Node Property Prediction | ogbn-mag | Number of params | 8371231 | SeHGNN |