TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Inductive Representation Learning on Large Graphs

Inductive Representation Learning on Large Graphs

William L. Hamilton, Rex Ying, Jure Leskovec

2017-06-07NeurIPS 2017 12Node Classification on Non-Homophilic (Heterophilic) GraphsRepresentation LearningGraph RegressionGraph ClassificationNode ClassificationLink Property PredictionNode Property PredictionLink Prediction
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCode

Abstract

Low-dimensional embeddings of nodes in large graphs have proved extremely useful in a variety of prediction tasks, from content recommendation to identifying protein functions. However, most existing approaches require that all nodes in the graph are present during training of the embeddings; these previous approaches are inherently transductive and do not naturally generalize to unseen nodes. Here we present GraphSAGE, a general, inductive framework that leverages node feature information (e.g., text attributes) to efficiently generate node embeddings for previously unseen data. Instead of training individual embeddings for each node, we learn a function that generates embeddings by sampling and aggregating features from a node's local neighborhood. Our algorithm outperforms strong baselines on three inductive node-classification benchmarks: we classify the category of unseen nodes in evolving information graphs based on citation and Reddit post data, and we show that our algorithm generalizes to completely unseen graphs using a multi-graph dataset of protein-protein interactions.

Results

TaskDatasetMetricValueModel
Graph RegressionZINC-500kMAE0.398GraphSage
Graph ClassificationCIFAR10 100kAccuracy (%)66.08GraphSage
Node ClassificationFacebookAccuracy38.9GraphSAGE (Hamilton et al., [2017a])
Node ClassificationBrazil Air-TrafficAccuracy0.404GraphSAGE (Hamilton et al., [2017a])
Node ClassificationPPIF161.2GraphSAGE
Node ClassificationWiki-VoteAccuracy24.5GraphSAGE (Hamilton et al., [2017a])
Node ClassificationCiteSeer with Public Split: fixed 20 nodes per classAccuracy67.2GraphSAGE
Node ClassificationEurope Air-TrafficAccuracy27.2GraphSAGE (Hamilton et al., [2017a])
Node ClassificationFlickrAccuracy0.641GraphSAGE (Hamilton et al., [2017a])
Node ClassificationUSA Air-TrafficAccuracy31.6GraphSAGE (Hamilton et al., [2017a])
Node ClassificationPATTERN 100kAccuracy (%)50.516GraphSage
Link Property Predictionogbl-ddiNumber of params1421057GraphSAGE
Link Property Predictionogbl-citation2Number of params460289Full-batch GraphSAGE
Link Property Predictionogbl-citation2Number of params460289NeighborSampling (SAGE aggr)
Link Property Predictionogbl-collabNumber of params460289GraphSAGE (val as input)
Link Property Predictionogbl-collabNumber of params460289GraphSAGE
Link Property Predictionogbl-collabNumber of params460289GraphSAGE (val as input)
Link Property Predictionogbl-ppaNumber of params424449GraphSAGE
ClassificationCIFAR10 100kAccuracy (%)66.08GraphSage
Node Property Predictionogbn-arxivNumber of params218664GraphSAGE
Node Property Predictionogbn-papers100MNumber of params5755172GraphSAGE_res_incep
Node Property Predictionogbn-productsNumber of params103983GraphSAGE + C&S + node2vec
Node Property Predictionogbn-productsNumber of params206895NeighborSampling (SAGE aggr)
Node Property Predictionogbn-productsNumber of params206895Full-batch GraphSAGE
Node Property Predictionogbn-proteinsNumber of params193136GraphSAGE
Node Property Predictionogbn-magNumber of params154366772NeighborSampling (R-GCN aggr)

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction2025-07-15Dual Dimensions Geometric Representation Learning Based Document Dewarping2025-07-11