TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Molecular Fingerprints Are Strong Models for Peptide Funct...

Molecular Fingerprints Are Strong Models for Peptide Function Prediction

Jakub Adamczyk, Piotr Ludynia, Wojciech Czech

2025-01-29Graph RegressionGraph Classification
PaperPDFCodeCodeCode(official)Code

Abstract

We study the effectiveness of molecular fingerprints for peptide property prediction and demonstrate that domain-specific feature extraction from molecular graphs can outperform complex and computationally expensive models such as GNNs, pretrained sequence-based transformers and multimodal ensembles, even without hyperparameter tuning. To this end, we perform a thorough evaluation on 126 datasets, achieving state-of-the-art results on LRGB and 5 other peptide function prediction benchmarks. We show that models based on count variants of ECFP, Topological Torsion, and RDKit molecular fingerprints and LightGBM as classification head are remarkably robust. The strong performance of molecular fingerprints, which are intrinsically very short-range feature encoders, challenges the presumed importance of long-range interactions in peptides. Our conclusion is that the use of molecular fingerprints for larger molecules, such as peptides, can be a computationally feasible, low-parameter, and versatile alternative to sophisticated deep learning models.

Results

TaskDatasetMetricValueModel
Graph RegressionPeptides-structMAE0.2432ECFP + LightGBM
Graph RegressionPeptides-structMAE0.2438TT + LightGBM
Graph RegressionPeptides-structMAE0.2459RDKit + LightGBM
Graph ClassificationPeptides-funcAP0.746ECFP + LightGBM
Graph ClassificationPeptides-funcAP0.7318TT + LightGBM
Graph ClassificationPeptides-funcAP0.7311RDKit + LightGBM
ClassificationPeptides-funcAP0.746ECFP + LightGBM
ClassificationPeptides-funcAP0.7318TT + LightGBM
ClassificationPeptides-funcAP0.7311RDKit + LightGBM

Related Papers

Density-aware Walks for Coordinated Campaign Detection2025-06-16Positional Encoding meets Persistent Homology on Graphs2025-06-06Graph Neural Networks for Jamming Source Localization2025-06-01Weisfeiler and Leman Follow the Arrow of Time: Expressive Power of Message Passing in Temporal Event Graphs2025-05-30Improving the Effective Receptive Field of Message-Passing Neural Networks2025-05-29A Benchmark Dataset for Graph Regression with Homogeneous and Multi-Relational Variants2025-05-29Graph Style Transfer for Counterfactual Explainability2025-05-23Scalable Graph Generative Modeling via Substructure Sequences2025-05-22