TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SemiRetro: Semi-template framework boosts deep retrosynthe...

SemiRetro: Semi-template framework boosts deep retrosynthesis prediction

Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li

2022-02-12PredictionGraph LearningRetrosynthesis
PaperPDF

Abstract

Recently, template-based (TB) and template-free (TF) molecule graph learning methods have shown promising results to retrosynthesis. TB methods are more accurate using pre-encoded reaction templates, and TF methods are more scalable by decomposing retrosynthesis into subproblems, i.e., center identification and synthon completion. To combine both advantages of TB and TF, we suggest breaking a full-template into several semi-templates and embedding them into the two-step TF framework. Since many semi-templates are reduplicative, the template redundancy can be reduced while the essential chemical knowledge is still preserved to facilitate synthon completion. We call our method SemiRetro, introduce a new GNN layer (DRGAT) to enhance center identification, and propose a novel self-correcting module to improve semi-template classification. Experimental results show that SemiRetro significantly outperforms both existing TB and TF methods. In scalability, SemiRetro covers 98.9\% data using 150 semi-templates, while previous template-based GLN requires 11,647 templates to cover 93.3\% data. In top-1 accuracy, SemiRetro exceeds template-free G2G 4.8\% (class known) and 6.0\% (class unknown). Besides, SemiRetro has better training efficiency than existing methods.

Results

TaskDatasetMetricValueModel
Single-step retrosynthesisUSPTO-50kTop-1 accuracy65.8SemiRetro (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-10 accuracy92.8SemiRetro (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-3 accuracy85.7SemiRetro (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-5 accuracy89.8SemiRetro (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-1 accuracy54.9SemiRetro (reaction class unknown)
Single-step retrosynthesisUSPTO-50kTop-10 accuracy84.1SemiRetro (reaction class unknown)
Single-step retrosynthesisUSPTO-50kTop-3 accuracy75.3SemiRetro (reaction class unknown)
Single-step retrosynthesisUSPTO-50kTop-5 accuracy80.4SemiRetro (reaction class unknown)

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15A Graph-in-Graph Learning Framework for Drug-Target Interaction Prediction2025-07-15Graph World Model2025-07-14Federated Learning with Graph-Based Aggregation for Traffic Forecasting2025-07-13Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11Foundation models for time series forecasting: Application in conformal prediction2025-07-09