TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MARS: A Motif-based Autoregressive Model for Retrosynthesi...

MARS: A Motif-based Autoregressive Model for Retrosynthesis Prediction

Jiahan Liu, Chaochao Yan, Yang Yu, Chan Lu, Junzhou Huang, Le Ou-Yang, Peilin Zhao

2022-09-27Drug DiscoveryPredictionRetrosynthesisSingle-step retrosynthesisGraph Generation
PaperPDF

Abstract

Retrosynthesis is a major task for drug discovery. It is formulated as a graph-generating problem by many existing approaches. Specifically, these methods firstly identify the reaction center, and break target molecule accordingly to generate synthons. Reactants are generated by either adding atoms sequentially to synthon graphs or directly adding proper leaving groups. However, both two strategies suffer since adding atoms results in a long prediction sequence which increases generation difficulty, while adding leaving groups can only consider the ones in the training set which results in poor generalization. In this paper, we propose a novel end-to-end graph generation model for retrosynthesis prediction, which sequentially identifies the reaction center, generates the synthons, and adds motifs to the synthons to generate reactants. Since chemically meaningful motifs are bigger than atoms and smaller than leaving groups, our method enjoys lower prediction complexity than adding atoms and better generalization than adding leaving groups. Experiments on a benchmark dataset show that the proposed model significantly outperforms previous state-of-the-art algorithms.

Results

TaskDatasetMetricValueModel
Single-step retrosynthesisUSPTO-50kTop-1 accuracy66.2MARS (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-10 accuracy92.9MARS (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-3 accuracy85.8MARS (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-5 accuracy90.2MARS (reaction class as prior)
Single-step retrosynthesisUSPTO-50kTop-1 accuracy54.6MARS (reaction class unknown)
Single-step retrosynthesisUSPTO-50kTop-10 accuracy88.5MARS (reaction class unknown)
Single-step retrosynthesisUSPTO-50kTop-3 accuracy76.4MARS (reaction class unknown)
Single-step retrosynthesisUSPTO-50kTop-5 accuracy83.3MARS (reaction class unknown)

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21NGTM: Substructure-based Neural Graph Topic Model for Interpretable Graph Generation2025-07-17Assay2Mol: large language model-based drug design using BioAssay context2025-07-16A Graph-in-Graph Learning Framework for Drug-Target Interaction Prediction2025-07-15Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation2025-07-10Foundation models for time series forecasting: Application in conformal prediction2025-07-09