Jinho Chang, Jong Chul Ye
With the emergence of diffusion models as the frontline of generative models, many researchers have proposed molecule generation techniques with conditional diffusion models. However, the unavoidable discreteness of a molecule makes it difficult for a diffusion model to connect raw data with highly complex conditions like natural language. To address this, we present a novel latent diffusion model dubbed LDMol for text-conditioned molecule generation. LDMol comprises a molecule autoencoder that produces a learnable and structurally informative feature space, and a natural language-conditioned latent diffusion model. In particular, recognizing that multiple SMILES notations can represent the same molecule, we employ a contrastive learning strategy to extract feature space that is aware of the unique characteristics of the molecule structure. LDMol outperforms the existing baselines on the text-to-molecule generation benchmark, suggesting a potential for diffusion models can outperform autoregressive models in text data generation with a better choice of the latent domain. Furthermore, we show that LDMol can be applied to downstream tasks such as molecule-to-text retrieval and text-guided molecule editing, demonstrating its versatility as a diffusion model.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Drug Discovery | ChEBI-20 | BLEU | 92.6 | LDMol |
| Drug Discovery | ChEBI-20 | Exact Match | 53.3 | LDMol |
| Drug Discovery | ChEBI-20 | Frechet ChemNet Distance (FCD) | 0.2 | LDMol |
| Drug Discovery | ChEBI-20 | Levenshtein | 6.75 | LDMol |
| Drug Discovery | ChEBI-20 | MACCS FTS | 97.3 | LDMol |
| Drug Discovery | ChEBI-20 | Morgan FTS | 93.1 | LDMol |
| Drug Discovery | ChEBI-20 | RDK FTS | 95 | LDMol |
| Drug Discovery | ChEBI-20 | Validity | 94.1 | LDMol |
| Text-based de novo Molecule Generation | ChEBI-20 | BLEU | 92.6 | LDMol |
| Text-based de novo Molecule Generation | ChEBI-20 | Exact Match | 53.3 | LDMol |
| Text-based de novo Molecule Generation | ChEBI-20 | Frechet ChemNet Distance (FCD) | 0.2 | LDMol |
| Text-based de novo Molecule Generation | ChEBI-20 | Levenshtein | 6.75 | LDMol |
| Text-based de novo Molecule Generation | ChEBI-20 | MACCS FTS | 97.3 | LDMol |
| Text-based de novo Molecule Generation | ChEBI-20 | Morgan FTS | 93.1 | LDMol |
| Text-based de novo Molecule Generation | ChEBI-20 | RDK FTS | 95 | LDMol |
| Text-based de novo Molecule Generation | ChEBI-20 | Validity | 94.1 | LDMol |