Abstract
In recent years, the ability of feature extraction of deep learning models has increased. This ability is used to extract high-level features from minimum pre-processing. We have used Convolutional Neural Networks (CNN), Convolutional Neural Networks Long Short-Term Memory (CNN-LSTM), and Convolutional Neural Networks-Bidirectional Long Short-Term Memory (CNN-Bi-LSTM) for performing the classification of DNA sequences. While doing the classification, we considered the sequences as text data. To represent sequences as input, we used one-hot vectors. The needed position data of nucleotides is preserved by doing this. This project has used the DNA-binding protein sequence dataset, and various metrics are used to evaluate the models. The conclusion we came to after overviewing the results is that CNN-Bidirectional LSTM showed high accuracy of 99.5%
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Schmidt MF (2022) DNA: Blueprint of the proteins. In: Chemical biology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-64412-6
Bailey J (2022) Nucleosides, nucleotides, polynucleotides (RNA and DNA) and the genetic code. In: Inventive geniuses who changed the world. Springer, Cham. https://doi.org/10.1007/978-3-030-81381-9
Aslan MF, Unlersen MF, Sabanci K, Durdu A (2021) CNN based transfer learning-BiLSTM network: a novel approach for COVID-19 infection detection. Appl Soft Comput 98:106912
Zhang YQ, Ji S, Li S, Yizhou (2020) DeepSite: bidirectional LSTM and CNN models for predicting DNA–protein binding. Int J Mach Learn Cybern 11. https://doi.org/10.1007/s13042-01900990-x
Gunasekaran H, Ramalakshmi K, Rex Macedo Arokiaraj A, Deepa Kanmani S, Venkatesan C, Suresh Gnana Dhas C (2021) Analysis of DNA sequence classification using CNN and hybrid models. Comput Math Methods Med 2021:1835056. PMID: 34306171; PMCID: PMC8285202. https://doi.org/10.1155/2021/1835056
Shadab S, Alam Khan MT, Neezi NA, Adilina S, Shatabda S (2020) DeepDBP: deep neural networks for identification of DNA-binding proteins. Inf Med Unlocked 19:100318
Trabelsi A, Chaabane M, Ben-Hur A (2019) Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities. Bioinformatics 35(14):i269–i277. https://doi.org/10.1093/bioinformatics/btz339
Mohammed A, Mahmoud B, Guo P. DNA sequence classification based on MLP with PILAE algorithm
Dhiman G, Juneja S, Viriyasitavat W, Mohafez H, Hadizadeh M, Islam MA, El Bayoumy I, Gulati K (2022) A Novel machine-learning-based hybrid CNN model for tumor identification in medical image processing. Sustainability 14:1447. https://doi.org/10.3390/su14031447
Li J, Huang Q, Ren S, Jiang L, Deng B, Qin Y (2023) A novel medical text classification model with Kalman filter for clinical decision making. Comput Methods Programs Biomed 200:105917. https://doi.org/10.1016/j.cmpb.2021.105917
Li G, Du X, Li X, Zou L, Zhang G, Wu Z (2021) Prediction of DNA binding proteins using local features and long-term dependencies with primary sequences based on deep learning. PeerJ 9:e11262. PMID: 33986992; PMCID: PMC8101451. https://doi.org/10.7717/peerj.11262
Tasdelen, A., Sen, B. A hybrid CNN-LSTM model for pre-miRNA classification.Sci Rep 11, 14125 (2021). https://doi.org/10.1038/s41598-021-93656-0
Abraham MA, Srinivasan H, Namboori C, Krishnan (2019) Healthcare security using blockchain for pharmacogenomics. J Int Pharm Res 6:529–533
Nguyen N, Tran V, Ngo D, Phan D, Lumbanraja F, Faisal M, Abapihi B, Kubo M, Satou K (2016) DNA sequence classification by convolutional neural network. J Biomed Sci Eng 9:280–286. https://doi.org/10.4236/jbise.2016.95021
Oviya IR, Spandana C, Krithika S, Priyadharshini AR (2022) Chest X-ray pathology detection using deep learning and transfer learning. In: 2022 IEEE 7th International conference on recent advances and innovations in engineering (ICRAIE), Mangalore, India, pp 25–30. https://doi.org/10.1109/ICRAIE56454.2022.10054329
Hu S, Ma R, Wang H (2019) An improved deep learning method for predicting DNA-binding proteins based on contextual features in amino acid sequences. PLoS ONE 14(11):e0225317. https://doi.org/10.1371/journal.pone.0225317
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Siva Jyothi Natha Reddy, B., Yadav, S., Venkatakrishnan, R., Oviya, I.R. (2023). Comparison of Deep Learning Approaches for DNA-Binding Protein Classification Using CNN and Hybrid Models. In: Tripathi, A.K., Anand, D., Nagar, A.K. (eds) Proceedings of World Conference on Artificial Intelligence: Advances and Applications. WWCA 1997. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-99-5881-8_7
Download citation
DOI: https://doi.org/10.1007/978-981-99-5881-8_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-5880-1
Online ISBN: 978-981-99-5881-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)