A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions

PengFei Liu, Jun Tao, Zhixiang Ren

2024-04-15Drug Discovery Retrosynthesis Large Language Model Chemical Reaction Prediction Language Modelling

Abstract

The task of chemical reaction predictions (CRPs) plays a pivotal role in advancing drug discovery and material science. However, its effectiveness is constrained by the vast and uncertain chemical reaction space and challenges in capturing reaction selectivity, particularly due to existing methods' limitations in exploiting the data's inherent knowledge. To address these challenges, we introduce a data-curated self-feedback knowledge elicitation approach. This method starts from iterative optimization of molecular representations and facilitates the extraction of knowledge on chemical reaction types (RTs). Then, we employ adaptive prompt learning to infuse the prior knowledge into the large language model (LLM). As a result, we achieve significant enhancements: a 14.2% increase in retrosynthesis prediction accuracy, a 74.2% rise in reagent prediction accuracy, and an expansion in the model's capability for handling multi-task chemical reactions. This research offers a novel paradigm for knowledge elicitation in scientific research and showcases the untapped potential of LLMs in CRPs.

Results

Task	Dataset	Metric	Value	Model
Chemical Reaction Prediction	Mol-Instruction	Exact	0.674	SLM4CRP
Chemical Reaction Prediction	Mol-Instruction	METEOR	0.901	SLM4CRP
Chemical Reaction Prediction	Mol-Instruction	Morgan FTS	0.854	SLM4CRP
Chemical Reaction Prediction	Mol-Instruction	Validity	0.998	SLM4CRP
Forward reaction prediction	Mol-Instruction	Exact	0.945	SLM4CRP
Forward reaction prediction	Mol-Instruction	METEOR	0.993	SLM4CRP
Forward reaction prediction	Mol-Instruction	Morgan FTS	0.986	SLM4CRP
Forward reaction prediction	Mol-Instruction	Validity	0.997	SLM4CRP
Reagent Prediction	Mol-Instruction	Exact	0.284	SLM4CRP
Reagent Prediction	Mol-Instruction	METEOR	0.744	SLM4CRP
Reagent Prediction	Mol-Instruction	Morgan FTS	0.649	SLM4CRP
Reagent Prediction	Mol-Instruction	Validity	1	SLM4CRP
Retrosynthesis	Mol-Instruction	Exact	0.757	SLM4CRP
Retrosynthesis	Mol-Instruction	METEOR	0.95	SLM4CRP
Retrosynthesis	Mol-Instruction	Morgan FTS	0.905	SLM4CRP
Retrosynthesis	Mol-Instruction	Validity	0.994	SLM4CRP

A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions

Abstract

Results

Related Papers

A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions

Abstract

Results

Related Papers