1 Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, Shanghai Frontiers Science Center of Molecule Intelligent Syntheses, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200062, China;
2 School of Computer Science and Engineering, Central South University, Changsha, Hunan 410083, China;
3 School of Science, Department of Basic Medicine and Clinical Pharmacy, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, China;
4 School of Physics and Electronics, Shandong Normal University, Jinan, 250014, China;
5 Innovation Center for Artificial Intelligence and Drug Discovery, East China Normal University, Shanghai, 200062, China;
6 Lingang Laboratory, Shanghai, 200062, China;
7 Chongqing Key Laboratory of Precision Optics, Chongqing Institute of East China Normal University, Chongqing, 401120, China;
8 New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai, 200062, China
Funds:
This work was supported by the National Natural Science Foundation of China (Grant Nos. 22273023, 12474285 and 22373116), the National Key R&D Program of China (Grant No. 2019YFA0905200), Shanghai Municipal Natural Science Foundation (Grant No. 23ZR1418200), Natural Science Foundation of Chongqing, China (Grant No. CSTB2023NSCQ-MSX0616), Shanghai Frontiers Science Center of Molecule Intelligent Syntheses, Shanghai Future Discipline Program (Quantum Science and Technology), Shanghai Municipal Education Commission's "Artificial Intelligence-Driven Research Paradigm Reform and Discipline Advancement Program" and the Fundamental Research Funds for the Central Universities. We also thank the Supercomputer Center of East China Normal University (ECNU Multifunctional Platform for Innovation 001) for providing computer resources.
The identification and optimization of mutations in nanobodies are crucial for enhancing their therapeutic potential in disease prevention and control. However, this process is often complex and time-consuming, which limit its widespread application in practice. In this study, we developed a workflow, named Evolutionary-Nanobody (EvoNB), to predict key mutation sites of nanobodies by combining protein language models (PLMs) and molecular dynamic (MD) simulations. By fine-tuning the ESM2 model on a large-scale nanobody dataset, the ability of EvoNB to capture specific sequence features of nanobodies was significantly enhanced. The fine-tuned EvoNB model demonstrated higher predictive accuracy in the conserved framework and highly variable complementarity-determining regions of nanobodies. Additionally, we selected four widely representative nanobody–antigen complexes to verify the predicted effects of mutations. MD simulations analyzed the energy changes caused by these mutations to predict their impact on binding affinity to the targets. The results showed that multiple mutations screened by EvoNB significantly enhanced the binding affinity between nanobody and its target, further validating the potential of this workflow for designing and optimizing nanobody mutations. Additionally, sequence-based predictions are generally less dependent on structural absence, allowing them to be more easily integrated with tools for structural predictions, such as AlphaFold 3. Through mutation prediction and systematic analysis of key sites, we can quickly predict the most promising variants for experimental validation without relying on traditional evolutionary or selection processes. The EvoNB workflow provides an effective tool for the rapid optimization of nanobodies and facilitates the application of PLMs in the biomedical field.