GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

PengFei Liu, Yiming Ren, Jun Tao, Zhixiang Ren

2023-08-14Drug Discovery Image Captioning Large Language Model Text-based de novo Molecule Generation Language Modelling Molecule Captioning

Paper PDF Code(official)

Abstract

Large language models have made significant strides in natural language processing, enabling innovative applications in molecular science by processing textual representations of molecules. However, most existing language models cannot capture the rich information with complex molecular structures or images. In this paper, we introduce GIT-Mol, a multi-modal large language model that integrates the Graph, Image, and Text information. To facilitate the integration of multi-modal molecular data, we propose GIT-Former, a novel architecture that is capable of aligning all modalities into a unified latent space. We achieve a 5%-10% accuracy increase in properties prediction and a 20.2% boost in molecule generation validity compared to the baselines. With the any-to-language molecular translation strategy, our model has the potential to perform more downstream tasks, such as compound name recognition and chemical reaction prediction.

Results

Task	Dataset	Metric	Value	Model
Drug Discovery	clintox	AUC	0.883	GIT-Mol(G+S)
Drug Discovery	BACE	AUC	0.8108	GIT-Mol(G+S)
Drug Discovery	Tox21	AUC	0.759	GIT-Mol(G+S)
Drug Discovery	BBBP	AUC	0.739	GIT-Mol(G+S)
Drug Discovery	ToxCast	AUC	0.668	GIT-Mol(G+S)
Drug Discovery	SIDER	AUC	0.634	GIT-Mol(G+S)
Drug Discovery	ChEBI-20	BLEU	75.6	GIT-Mol-caption
Drug Discovery	ChEBI-20	Exact Match	5.1	GIT-Mol-caption
Drug Discovery	ChEBI-20	Levenshtein	26.315	GIT-Mol-caption
Drug Discovery	ChEBI-20	MACCS FTS	73.8	GIT-Mol-caption
Drug Discovery	ChEBI-20	Morgan FTS	51.9	GIT-Mol-caption
Drug Discovery	ChEBI-20	RDK FTS	58.2	GIT-Mol-caption
Drug Discovery	ChEBI-20	Validity	92.8	GIT-Mol-caption
Image Captioning	ChEBI-20	BLEU	0.924	GIT-Mol
Image Captioning	ChEBI-20	Exact	0.461	GIT-Mol
Image Captioning	ChEBI-20	Levenshtein	6.575	GIT-Mol
Image Captioning	ChEBI-20	MACCS FTS	0.962	GIT-Mol
Image Captioning	ChEBI-20	Morgan FTS	0.894	GIT-Mol
Image Captioning	ChEBI-20	RDK FTS	0.906	GIT-Mol
Image Captioning	ChEBI-20	Validity	0.899	GIT-Mol
Text-based de novo Molecule Generation	ChEBI-20	BLEU	75.6	GIT-Mol-caption
Text-based de novo Molecule Generation	ChEBI-20	Exact Match	5.1	GIT-Mol-caption
Text-based de novo Molecule Generation	ChEBI-20	Levenshtein	26.315	GIT-Mol-caption
Text-based de novo Molecule Generation	ChEBI-20	MACCS FTS	73.8	GIT-Mol-caption
Text-based de novo Molecule Generation	ChEBI-20	Morgan FTS	51.9	GIT-Mol-caption
Text-based de novo Molecule Generation	ChEBI-20	RDK FTS	58.2	GIT-Mol-caption
Text-based de novo Molecule Generation	ChEBI-20	Validity	92.8	GIT-Mol-caption

GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

Abstract

Results

Related Papers

GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

Abstract

Results

Related Papers