TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient D...

PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed Training

Seth Ockerman, Amal Gueroudji, Tanwi Mallick, Yixuan He, Line Pouchard, Robert Ross, Shivaram Venkataraman

2025-07-15graph partitioning
PaperPDFCode(official)Code(official)

Abstract

Spatiotemporal graph neural networks (ST-GNNs) are powerful tools for modeling spatial and temporal data dependencies. However, their applications have been limited primarily to small-scale datasets because of memory constraints. While distributed training offers a solution, current frameworks lack support for spatiotemporal models and overlook the properties of spatiotemporal data. Informed by a scaling study on a large-scale workload, we present PyTorch Geometric Temporal Index (PGT-I), an extension to PyTorch Geometric Temporal that integrates distributed data parallel training and two novel strategies: index-batching and distributed-index-batching. Our index techniques exploit spatiotemporal structure to construct snapshots dynamically at runtime, significantly reducing memory overhead, while distributed-index-batching extends this approach by enabling scalable processing across multiple GPUs. Our techniques enable the first-ever training of an ST-GNN on the entire PeMS dataset without graph partitioning, reducing peak memory usage by up to 89\% and achieving up to a 13.1x speedup over standard DDP with 128 GPUs.

Related Papers

Demystifying Distributed Training of Graph Neural Networks for Link Prediction2025-06-25DistMLIP: A Distributed Inference Platform for Machine Learning Interatomic Potentials2025-05-28Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning2025-05-20A Graph-Partitioning Based Continuous Optimization Approach to Semi-supervised Clustering Problems2025-03-06Node-level Contrastive Unlearning on Graph Neural Networks2025-03-04WaveGAS: Waveform Relaxation for Scaling Graph Neural Networks2025-02-27Dual-Bounded Nonlinear Optimal Transport for Size Constrained Min Cut Clustering2025-01-30Hierarchical Superpixel Segmentation via Structural Information Theory2025-01-13