TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Open Graph Benchmark: Datasets for Machine Learning on Gra...

Open Graph Benchmark: Datasets for Machine Learning on Graphs

Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, Jure Leskovec

2020-05-02NeurIPS 2020 12Knowledge GraphsNode Property Prediction
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCode

Abstract

We present the Open Graph Benchmark (OGB), a diverse set of challenging and realistic benchmark datasets to facilitate scalable, robust, and reproducible graph machine learning (ML) research. OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains, ranging from social and information networks to biological networks, molecular graphs, source code ASTs, and knowledge graphs. For each dataset, we provide a unified evaluation protocol using meaningful application-specific data splits and evaluation metrics. In addition to building the datasets, we also perform extensive benchmark experiments for each dataset. Our experiments suggest that OGB datasets present significant challenges of scalability to large-scale graphs and out-of-distribution generalization under realistic data splits, indicating fruitful opportunities for future research. Finally, OGB provides an automated end-to-end graph ML pipeline that simplifies and standardizes the process of graph data loading, experimental setup, and model evaluation. OGB will be regularly updated and welcomes inputs from the community. OGB datasets as well as data loaders, evaluation scripts, baseline code, and leaderboards are publicly available at https://ogb.stanford.edu .

Results

TaskDatasetMetricValueModel
Link Property Predictionogbl-ddiNumber of params1224193Matrix Factorization
Link Property Predictionogbl-citation2Number of params281113505Matrix Factorization
Link Property Predictionogbl-collabNumber of params60514049Matrix Factorization
Link Property Predictionogbl-ppaNumber of params147662849Matrix Factorization
Node Property Predictionogbn-arxivNumber of params110120MLP
Node Property Predictionogbn-papers100MNumber of params144044MLP
Node Property Predictionogbn-productsNumber of params103727MLP
Node Property Predictionogbn-proteinsNumber of params96880MLP
Node Property Predictionogbn-magNumber of params188509MLP

Related Papers

SMART: Relation-Aware Learning of Geometric Representations for Knowledge Graphs2025-07-17Topic Modeling and Link-Prediction for Material Property Discovery2025-07-08Graph Collaborative Attention Network for Link Prediction in Knowledge Graphs2025-07-05Context-Driven Knowledge Graph Completion with Semantic-Aware Relational Message Passing2025-06-29Active Inference AI Systems for Scientific Discovery2025-06-26Enhancing LLM Tool Use with High-quality Instruction Data from Knowledge Graph2025-06-26Generating Reliable Adverse event Profiles for Health through Automated Integrated Data (GRAPH-AID): A Semi-Automated Ontology Building Approach2025-06-25Inference Scaled GraphRAG: Improving Multi Hop Question Answering on Knowledge Graphs2025-06-24