Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China
Funds:
This work was supported by the National Key Research and Development Program of China (Grant 2023YFF1204904), the National Natural Science Foundation of China (Grants U23A20530 and 82173746) and Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism (Shanghai Municipal Education Commission).
pKa significantly influences the ADMET properties of molecules and is a crucial indicator in drug research. Given the rapid and accurate characteristics of computational methods, their role in predicting drug properties is increasingly important. Although many pKa prediction models currently exist, they often focus on enhancing model precision while neglecting interpretability. In this study, we present GraFpKa, a pKa prediction model using graph neural networks (GNNs) and molecular fingerprints. The results show that our acidic and basic models achieved mean absolute errors (MAEs) of 0.621 and 0.402, respectively, on the test set, demonstrating good predictive performance. Notably, to improve interpretability, GraFpKa also incorporates Integrated Gradients (IGs), providing a clearer visual description of the atoms significantly affecting the pKa values. The high reliability and interpretability of GraFpKa ensure accurate pKa predictions while also facilitating a deeper understanding of the relationship between molecular structure and pKa values, making it a valuable tool in the field of pKa prediction.