approved_drug_target

Approved Drug SMILES and Protein Sequence Dataset

Textscc-by-nc-4.0Introduced 2025-04-18

This dataset provides a curated collection of approved drug Simplified Molecular Input Line Entry System (SMILES) strings and their associated protein sequences. Each small molecule has been approved by at least one regulatory body, ensuring the safety and relevance of the data for computational applications. The dataset includes 1,660 approved small molecules and their 2,093 related protein targets.