Keyulu Xu, Weihua Hu, Jure Leskovec, Stefanie Jegelka
Graph Neural Networks (GNNs) are an effective framework for representation learning of graphs. GNNs follow a neighborhood aggregation scheme, where the representation vector of a node is computed by recursively aggregating and transforming representation vectors of its neighboring nodes. Many GNN variants have been proposed and have achieved state-of-the-art results on both node and graph classification tasks. However, despite GNNs revolutionizing graph representation learning, there is limited understanding of their representational properties and limitations. Here, we present a theoretical framework for analyzing the expressive power of GNNs to capture different graph structures. Our results characterize the discriminative power of popular GNN variants, such as Graph Convolutional Networks and GraphSAGE, and show that they cannot learn to distinguish certain simple graph structures. We then develop a simple architecture that is provably the most expressive among the class of GNNs and is as powerful as the Weisfeiler-Lehman graph isomorphism test. We empirically validate our theoretical findings on a number of graph classification benchmarks, and demonstrate that our model achieves state-of-the-art performance.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Graph Regression | PCQM4Mv2-LSC | Test MAE | 0.1218 | GIN |
| Graph Regression | PCQM4Mv2-LSC | Validation MAE | 0.1195 | GIN |
| Graph Regression | ZINC-500k | MAE | 0.526 | GIN |
| Graph Classification | COX2 | Accuracy(10-fold) | 81.13 | GIN-0 |
| Graph Classification | CIFAR10 100k | Accuracy (%) | 53.28 | GIN |
| Graph Classification | REDDIT-B | Accuracy | 92.4 | GIN-0 |
| Node Classification | PATTERN 100k | Accuracy (%) | 85.59 | GIN |
| Graph Property Prediction | ogbg-molhiv | Number of params | 3336306 | GIN+virtual node |
| Graph Property Prediction | ogbg-molhiv | Number of params | 1885206 | GIN |
| Graph Property Prediction | ogbg-code2 | Number of params | 13841815 | GIN+virtual node |
| Graph Property Prediction | ogbg-code2 | Number of params | 12390715 | GIN |
| Graph Property Prediction | ogbg-ppa | Number of params | 3288042 | GIN+virtual node |
| Graph Property Prediction | ogbg-ppa | Number of params | 1836942 | GIN |
| Graph Property Prediction | ogbg-molpcba | Number of params | 3374533 | GIN+virtual node |
| Graph Property Prediction | ogbg-molpcba | Number of params | 1923433 | GIN |
| Classification | COX2 | Accuracy(10-fold) | 81.13 | GIN-0 |
| Classification | CIFAR10 100k | Accuracy (%) | 53.28 | GIN |
| Classification | REDDIT-B | Accuracy | 92.4 | GIN-0 |