a. Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica Chinese Academy of Sciences, Shanghai, 201203, China;
b. University of Chinese Academy of Sciences, Beijing, 100049, China;
c. Nanjing University of Chinese Medicine, Nanjing, 210023, China;
d. Lingang Laboratory, Shanghai, 200031, China;
e. School of Information Science and Technology, ShanghaiTech University, Shanghai, 201210, China;
f. Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China;
g. School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, Hangzhou, 330106, China;
h. College of Computer and Information Engineering, Dezhou University, Dezhou, Shandong, 253023, China;
i. College of Agriculture and Biological Science, Dali University, Dali, Yunnan, 671003, China
Funds:
This work was supported by National Key Research and Development Program of China (2022YFC3400504 to M.Y.Z. and 2023YFC2305904 to M.Y.Z.), the Strategic Priority Research Program of the Chinese Academy of sciences (XDB0830203 to X.T.L. and XDB0830200 to M.Y.Z.), National Natural Science Foundation of China (82204278 to X.T.L., 31960198 to L.Q.Y., T2225002 to M.Y.Z. and 82273855 to M.Y.Z.), SIMM-SHUTCM Traditional Chinese Medicine Innovation Joint Research Program (E2G805H to M.Y.Z.), Shanghai Municipal Science and Technology Major Project, and Key Technologies R&
D Program of Guangdong Province (2023B1111030004 to M.Y.Z.). We also acknowledge Shanghai Supercomputer Center for providing computing resources.
Kirsten rat sarcoma viral oncogene homolog (KRAS) protein inhibitors are a promising class of therapeutics, but research on molecules that effectively penetrate the blood-brain barrier (BBB) remains limited, which is crucial for treating central nervous system (CNS) malignancies. Although molecular generation models have recently advanced drug discovery, they often overlook the complexity of biological and chemical factors, leaving room for improvement. In this study, we present a structure-constrained molecular generation workflow designed to optimize lead compounds for both drug efficacy and drug absorption properties. Our approach utilizes a variational autoencoder (VAE) generative model integrated with reinforcement learning for multi-objective optimization. This method specifically aims to enhance BBB permeability while maintaining high-affinity substructures of KRAS inhibitors. To support this, we incorporate a specialized KRAS BBB predictor based on active learning and an affinity predictor employing comparative learning models. Additionally, we introduce two novel metrics, the knowledge-integrated reproduction score (KIRS) and the composite diversity score (CDS), to assess structural performance and biological relevance. Retrospective validation with KRAS inhibitors, AMG510 and MRTX849, demonstrates the framework’s effectiveness in optimizing BBB permeability and highlights its potential for real-world drug development applications. This study provides a robust framework for accelerating the structural enhancement of lead compounds, advancing the drug development process across diverse targets.