Abstract
Drug de novo design has developed rapidly and innovatively in recent years. In this way, novel drug structures can be designed, which has made outstanding contributions to the field of drug design. Improving the quality of generated drugs is the unifying goal of most published experimental papers, and the sheer volume of data provides a solid foundation for achieving this goal. However, as the amount of drug data becomes larger and larger, we have to consider the time cost of such experiments. Correspondingly, when we are studying a small number of drugs with specific properties, the fitting ability of the model becomes critical. As a result, some models perform wildly differently on different datasets. In view of SMILES (Simplified molecular input line entry system) experiment data of different data sets, this paper proposes a “Unified Multilayer SRU De novo drug design acceleration Model” (USD) based on multi-layer Simple Recurrent Unit (SRU). Aiming at the problem that the amount of data in deep learning has a greater impact on the effect of experimental training, this experiment trains the SMILES data in the DugBank database (small data amount) and ChEMBL (big data amount), and finally generates novel drug molecular data. Both the efficiency of molecular generation and the time cost of model training and data generation have been greatly improved. A series of comparative experiments have proved that USD has a good balance ability between drug generation quality and time cost, which proves that this new research direction has certain experimental and reference value.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Mouchlis, V.D., Melagraki, G., Zacharia, L.C., Afantitis, A., et al.: Computer-aided drug design of β-secretase, γ-secretase and anti-tau inhibitors for the discovery of novel Alzheimer’s therapeutics. Int. J. Mol. Sci. 21, 703 (2020). Author, F., Author, S.: Title of a proceedings paper. In: Editor, F., Editor, S. (eds.) CONFERENCE 2016, LNCS, vol. 9999, pp. 1–13. Springer, Heidelberg (2016)
Schneider, P., Schneider, G.: De novo design at the edge of Chaos. J. Med. Chem. 59, 4077–4086 (2016)
Mouchlis, V.D., Afantitis, A., Serra, A., et al.: Advances in de novo drug design: from conventional to machine learning methods. Int. J. Mol. Sci. 22, 1676 (2021)
Marcus, O., Thomas, B., Ola, E., Hongming, C.: Molecular de-novo design through deep reinforcement learning. J. Cheminform. 48 (2017)
Segler, M.H.S., Kogej, T., Tyrchan, C., Waller, M.P.: Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS Central Sci. 4(1), 120–131 (2017)
Daniel, M., Lukas, F., Francesca, G., Gisbert, S.: De novo design of bioactive small molecules by artificial intelligence. Mol. Inform. 37 (2018)
Gupta, A., Müller, A.T., Huisman, B.J.H., Fuchs, J.A., Schneider, P., Schneider, G.: Generative recurrent networks for de novo drug design. Mol. Inform. 37 (2018)
Niclas, S., Göran, F., Alexander, K., Gunnar, M., Jonas, B.: Deep reinforcement learning for multiparameter optimization in de novo drug design. J. Chem. Inf. Model. 59, 3166–3176 (2019)
Prykhodko, O., et al.: A de novo molecular generation method using latent vector based generative adversarial network. J. Cheminform. 74 (2019)
Zheng, S., Yan, X., Gu, Q., et al.: QBMG: quasi-biogenic molecule generator with deep recurrent neural network. J. Cheminform. 11 (2019)
Li, Y., Hu, J., Wang, Y., et al.: DeepScaffold: a comprehensive tool for scaffold-based de novo drug discovery using deep learning. J. Chem. Inf. Model. 60, 77–91 (2019)
Yasonik, J.: Multiobjective de novo drug design with recurrent neural networks and nondominated sorting. J. Cheminform. 12 (2020)
Tao, L., Yu, Z.,Sida, I.W., Hui, D., Yoav, A.: Simple recurrent units for highly parallelizable recurrence. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (2018)
Jhoti, H., Williams, G., Rees, D.C., Murray, C.W.: The ‘rule of three’ for fragment-based drug discovery: where are we now? Nat. Rev. Drug Discov. 12 (2013)
Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6, 182–197 (2002)
Fonseca, C.M., Fleming, P.J.: Genetic algorithms for multiobjective optimization: formulation discussion and generalization. In: Proceedings of the 5th International Conference on Genetic Algorithms, Urbana-Champaign, IL, USA, June 1993 (1993)
RDKit: Open-Source Cheminformatics. https://www.rdkit.org
ChEMBL: A manually curated database of bioactive molecules with drug-like properties. https://www.ebi.ac.uk/chembl/
PubChem: The world’s largest collection of freely accessible chemical information. https://pubchem.ncbi.nlm.nih.gov/
BindingDB: A public, web-accessible database. https://www.bindingdb.org/bind/index.jsp.
Wishart, D.S., et al.: DrugBank 5.0: a major update to the DrugBank database for 2018. Nucl. Acids Res. (2017). https://doi.org/10.1093/nar/gkx1037
Morris, G.M., et al.: Autodock4 and AutoDockTools4: automated docking with selective receptor flexibility. J. Comput. Chem. 16, 2785–2791 (2009)
Acknowledgment
This work is supported by the National Natural Science Foundation of China (No.61972299). The authors thank the members of Machine Learning and Artificial Intelligence Laboratory, School of Computer Science and Technology, Wuhan University of Science and Technology, for their helpful discussion within seminars.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Li, Z., Hu, J., Zhang, X. (2023). De Novo Drug Design Using Unified Multilayer Simple Recurrent Unit Model. In: Huang, DS., Premaratne, P., **, B., Qu, B., Jo, KH., Hussain, A. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2023. Lecture Notes in Computer Science, vol 14088. Springer, Singapore. https://doi.org/10.1007/978-981-99-4749-2_54
Download citation
DOI: https://doi.org/10.1007/978-981-99-4749-2_54
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-4748-5
Online ISBN: 978-981-99-4749-2
eBook Packages: Computer ScienceComputer Science (R0)