Sinan Wang, Huiru Xiang, Xinyuan Pan, Jianyang Pan, Lu Zhao, Yi Wang, Shaoqing Cui, Yu Tang. Integrating biogravimetric analysis and machine learning for systematic studies of botanical materials: From bioactive constituent identification to production area prediction[J]. Journal of Pharmaceutical Analysis. doi: 10.1016/j.jpha.2025.101222
Citation:
Sinan Wang, Huiru Xiang, Xinyuan Pan, Jianyang Pan, Lu Zhao, Yi Wang, Shaoqing Cui, Yu Tang. Integrating biogravimetric analysis and machine learning for systematic studies of botanical materials: From bioactive constituent identification to production area prediction[J]. Journal of Pharmaceutical Analysis. doi: 10.1016/j.jpha.2025.101222
Sinan Wang, Huiru Xiang, Xinyuan Pan, Jianyang Pan, Lu Zhao, Yi Wang, Shaoqing Cui, Yu Tang. Integrating biogravimetric analysis and machine learning for systematic studies of botanical materials: From bioactive constituent identification to production area prediction[J]. Journal of Pharmaceutical Analysis. doi: 10.1016/j.jpha.2025.101222
Citation:
Sinan Wang, Huiru Xiang, Xinyuan Pan, Jianyang Pan, Lu Zhao, Yi Wang, Shaoqing Cui, Yu Tang. Integrating biogravimetric analysis and machine learning for systematic studies of botanical materials: From bioactive constituent identification to production area prediction[J]. Journal of Pharmaceutical Analysis. doi: 10.1016/j.jpha.2025.101222
Integrating biogravimetric analysis and machine learning for systematic studies of botanical materials: From bioactive constituent identification to production area prediction
a Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China;
b Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Hangzhou 310018, China;
c National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing 314100, China;
d Department of Biosystems Engineering and Food Science, Zhejiang University, 866 Yuhangtang Road, Hangzhou 310058, China
Funds:
This work was supported by the Project of the National Key Research and Development Program of China (2022YFC3501802), NSFC (82204614), and Zhejiang Provincial Basic Public Welfare Research Program (LTGC24H280002). The authors wish to thank Xiao Li and Prof. Tianxiang Li (Tianjin University of Traditional Chinese Medicine) for collecting and identifying all the CR samples. We thank Yingniang Li from the Core Facilities, Zhejiang University School of Medicine for their technical support.
In general, bioassay-guided fractionation and isolation of bioactive constituents from botanical materials frequently ended up with the reward of a single compound. However, botanical materials typically exert their therapeutic actions through multi- pathway effects due to the intrinsic complex nature of chemical constituents. In addition, the content of bioactive compounds in botanical materials is largely dependent on humidity, temperature, soil, especially geographical origins, from which rapid and accurate identification of plant materials is pressingly needed. These long-standing obstacles collectively impede the deep exploitation and application of these versatile natural sources. To address the challenges, a new paradigm integrating biogravimetric analyses and machine learning-driven origin classification (BAMLOC) was developed. The biogravimetric analyses are based on absolute qHNMR quantification and in vivo zebrafish model-assisted activity index calculation, by which bioactive substance groups jointly responsible for the bioactivities in all fractions are pinpointed before any isolation effort. To differentiate origin-different botanical materials varying in the content of bioactive substance groups, principal component analysis, linear discriminant analysis, and hierarchical cluster analysis in conjunction with supervised support vector machine are employed to classify and predict production areas based on the detection of volatile organic compounds by E-nose and GC-MS. Expanding BAMLOC to Codonopsis Radix enables the identification of polyacetylenes and pyrrolidine alkaloids as the bioactive substance group for immune restoration effect and accurately determines the origins of plants. This study advances the toolbox for the discovery of bioactive compounds from complex mixtures and lays a more definitive foundation for the in-depth utilization of botanical materials.