TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Energy-based View of Retrosynthesis

Energy-based View of Retrosynthesis

Ruoxi Sun, Hanjun Dai, Li Li, Steven Kearnes, Bo Dai

2020-07-14Drug DiscoveryRetrosynthesisSingle-step retrosynthesis
PaperPDF

Abstract

Retrosynthesis -- the process of identifying a set of reactants to synthesize a target molecule -- is of vital importance to material design and drug discovery. Existing machine learning approaches based on language models and graph neural networks have achieved encouraging results. In this paper, we propose a framework that unifies sequence- and graph-based methods as energy-based models (EBMs) with different energy functions. This unified perspective provides critical insights about EBM variants through a comprehensive assessment of performance. Additionally, we present a novel dual variant within the framework that performs consistent training over Bayesian forward- and backward-prediction by constraining the agreement between the two directions. This model improves state-of-the-art performance by 9.6% for template-free approaches where the reaction type is unknown.

Results

TaskDatasetMetricValueModel
Single-step retrosynthesisUSPTO-50kTop-1 accuracy65.7Dual-TF (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-10 accuracy85.9Dual-TF (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-3 accuracy81.9Dual-TF (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-5 accuracy84.7Dual-TF (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-1 accuracy53.6Dual-TF (reaction class unknown)
Single-step retrosynthesisUSPTO-50kTop-10 accuracy77Dual-TF (reaction class unknown)
Single-step retrosynthesisUSPTO-50kTop-3 accuracy70.7Dual-TF (reaction class unknown)
Single-step retrosynthesisUSPTO-50kTop-5 accuracy74.6Dual-TF (reaction class unknown)

Related Papers

Assay2Mol: large language model-based drug design using BioAssay context2025-07-16A Graph-in-Graph Learning Framework for Drug-Target Interaction Prediction2025-07-15Graph Learning2025-07-08DeepRetro: Retrosynthetic Pathway Discovery using Iterative LLM Reasoning2025-07-07Exploring Modularity of Agentic Systems for Drug Discovery2025-06-27Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design2025-06-26Large Language Model Agent for Modular Task Execution in Drug Discovery2025-06-26PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket Conditioning2025-06-24