TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Analyzing Learned Molecular Representations for Property P...

Analyzing Learned Molecular Representations for Property Prediction

Kevin Yang, Kyle Swanson, Wengong Jin, Connor Coley, Philipp Eiden, Hua Gao, Angel Guzman-Perez, Timothy Hopper, Brian Kelley, Miriam Mathea, Andrew Palmer, Volker Settels, Tommi Jaakkola, Klavs Jensen, Regina Barzilay

2019-04-02Molecular Property PredictionPrediction
PaperPDFCode(official)CodeCodeCode

Abstract

Advancements in neural machinery have led to a wide range of algorithmic solutions for molecular property prediction. Two classes of models in particular have yielded promising results: neural networks applied to computed molecular fingerprints or expert-crafted descriptors, and graph convolutional neural networks that construct a learned molecular representation by operating on the graph structure of the molecule. However, recent literature has yet to clearly determine which of these two methods is superior when generalizing to new chemical space. Furthermore, prior research has rarely examined these new models in industry research settings in comparison to existing employed models. In this paper, we benchmark models extensively on 19 public and 16 proprietary industrial datasets spanning a wide variety of chemical endpoints. In addition, we introduce a graph convolutional model that consistently matches or outperforms models using fixed molecular descriptors as well as previous graph neural architectures on both public and proprietary datasets. Our empirical findings indicate that while approaches based on these representations have yet to reach the level of experimental reproducibility, our proposed model nevertheless offers significant improvements over models currently used in industrial workflows.

Results

TaskDatasetMetricValueModel
Molecular Property PredictionFreeSolvRMSE2.082D-MPNN
Molecular Property PredictionclintoxROC-AUC90.6D-MPNN
Molecular Property PredictionToxCastROC-AUC65.5D-MPNN
Molecular Property PredictionLipophilicityRMSE0.683D-MPNN
Molecular Property PredictionQM7MAE103.5D-MPNN
Molecular Property PredictionBBBPROC-AUC71D-MPNN
Molecular Property PredictionQM9MAE0.00814D-MPNN
Molecular Property PredictionQM8MAE0.019D-MPNN
Molecular Property PredictionSIDERROC-AUC57D-MPNN
Molecular Property PredictionTox21ROC-AUC75.9D-MPNN
Molecular Property PredictionBACEROC-AUC80.9D-MPNN
Molecular Property PredictionESOLRMSE1.05D-MPNN
Atomistic DescriptionFreeSolvRMSE2.082D-MPNN
Atomistic DescriptionclintoxROC-AUC90.6D-MPNN
Atomistic DescriptionToxCastROC-AUC65.5D-MPNN
Atomistic DescriptionLipophilicityRMSE0.683D-MPNN
Atomistic DescriptionQM7MAE103.5D-MPNN
Atomistic DescriptionBBBPROC-AUC71D-MPNN
Atomistic DescriptionQM9MAE0.00814D-MPNN
Atomistic DescriptionQM8MAE0.019D-MPNN
Atomistic DescriptionSIDERROC-AUC57D-MPNN
Atomistic DescriptionTox21ROC-AUC75.9D-MPNN
Atomistic DescriptionBACEROC-AUC80.9D-MPNN
Atomistic DescriptionESOLRMSE1.05D-MPNN

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11Foundation models for time series forecasting: Application in conformal prediction2025-07-09Predicting Graph Structure via Adapted Flux Balance Analysis2025-07-08Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis2025-07-08A Wireless Foundation Model for Multi-Task Prediction2025-07-08Acquiring and Adapting Priors for Novel Tasks via Neural Meta-Architectures2025-07-07