Comparison of Deep Learning Approaches for DNA-Binding Protein Classification Using CNN and Hybrid Models

  • Conference paper
  • First Online:
Proceedings of World Conference on Artificial Intelligence: Advances and Applications (WWCA 1997)

Abstract

In recent years, the ability of feature extraction of deep learning models has increased. This ability is used to extract high-level features from minimum pre-processing. We have used Convolutional Neural Networks (CNN), Convolutional Neural Networks Long Short-Term Memory (CNN-LSTM), and Convolutional Neural Networks-Bidirectional Long Short-Term Memory (CNN-Bi-LSTM) for performing the classification of DNA sequences. While doing the classification, we considered the sequences as text data. To represent sequences as input, we used one-hot vectors. The needed position data of nucleotides is preserved by doing this. This project has used the DNA-binding protein sequence dataset, and various metrics are used to evaluate the models. The conclusion we came to after overviewing the results is that CNN-Bidirectional LSTM showed high accuracy of 99.5%

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 279.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Schmidt MF (2022) DNA: Blueprint of the proteins. In: Chemical biology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-64412-6

  2. Bailey J (2022) Nucleosides, nucleotides, polynucleotides (RNA and DNA) and the genetic code. In: Inventive geniuses who changed the world. Springer, Cham. https://doi.org/10.1007/978-3-030-81381-9

  3. Aslan MF, Unlersen MF, Sabanci K, Durdu A (2021) CNN based transfer learning-BiLSTM network: a novel approach for COVID-19 infection detection. Appl Soft Comput 98:106912

    Article  Google Scholar 

  4. Zhang YQ, Ji S, Li S, Yizhou (2020) DeepSite: bidirectional LSTM and CNN models for predicting DNA–protein binding. Int J Mach Learn Cybern 11. https://doi.org/10.1007/s13042-01900990-x

  5. Gunasekaran H, Ramalakshmi K, Rex Macedo Arokiaraj A, Deepa Kanmani S, Venkatesan C, Suresh Gnana Dhas C (2021) Analysis of DNA sequence classification using CNN and hybrid models. Comput Math Methods Med 2021:1835056. PMID: 34306171; PMCID: PMC8285202. https://doi.org/10.1155/2021/1835056

  6. Shadab S, Alam Khan MT, Neezi NA, Adilina S, Shatabda S (2020) DeepDBP: deep neural networks for identification of DNA-binding proteins. Inf Med Unlocked 19:100318

    Google Scholar 

  7. Trabelsi A, Chaabane M, Ben-Hur A (2019) Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities. Bioinformatics 35(14):i269–i277. https://doi.org/10.1093/bioinformatics/btz339

    Article  Google Scholar 

  8. Mohammed A, Mahmoud B, Guo P. DNA sequence classification based on MLP with PILAE algorithm

    Google Scholar 

  9. Dhiman G, Juneja S, Viriyasitavat W, Mohafez H, Hadizadeh M, Islam MA, El Bayoumy I, Gulati K (2022) A Novel machine-learning-based hybrid CNN model for tumor identification in medical image processing. Sustainability 14:1447. https://doi.org/10.3390/su14031447

    Article  Google Scholar 

  10. Li J, Huang Q, Ren S, Jiang L, Deng B, Qin Y (2023) A novel medical text classification model with Kalman filter for clinical decision making. Comput Methods Programs Biomed 200:105917. https://doi.org/10.1016/j.cmpb.2021.105917

    Article  Google Scholar 

  11. Li G, Du X, Li X, Zou L, Zhang G, Wu Z (2021) Prediction of DNA binding proteins using local features and long-term dependencies with primary sequences based on deep learning. PeerJ 9:e11262. PMID: 33986992; PMCID: PMC8101451. https://doi.org/10.7717/peerj.11262

  12. Tasdelen, A., Sen, B. A hybrid CNN-LSTM model for pre-miRNA classification.Sci Rep 11, 14125 (2021). https://doi.org/10.1038/s41598-021-93656-0

  13. Abraham MA, Srinivasan H, Namboori C, Krishnan (2019) Healthcare security using blockchain for pharmacogenomics. J Int Pharm Res 6:529–533

    Google Scholar 

  14. Nguyen N, Tran V, Ngo D, Phan D, Lumbanraja F, Faisal M, Abapihi B, Kubo M, Satou K (2016) DNA sequence classification by convolutional neural network. J Biomed Sci Eng 9:280–286. https://doi.org/10.4236/jbise.2016.95021

    Article  Google Scholar 

  15. Oviya IR, Spandana C, Krithika S, Priyadharshini AR (2022) Chest X-ray pathology detection using deep learning and transfer learning. In: 2022 IEEE 7th International conference on recent advances and innovations in engineering (ICRAIE), Mangalore, India, pp 25–30. https://doi.org/10.1109/ICRAIE56454.2022.10054329

  16. Hu S, Ma R, Wang H (2019) An improved deep learning method for predicting DNA-binding proteins based on contextual features in amino acid sequences. PLoS ONE 14(11):e0225317. https://doi.org/10.1371/journal.pone.0225317

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to I. R. Oviya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Siva Jyothi Natha Reddy, B., Yadav, S., Venkatakrishnan, R., Oviya, I.R. (2023). Comparison of Deep Learning Approaches for DNA-Binding Protein Classification Using CNN and Hybrid Models. In: Tripathi, A.K., Anand, D., Nagar, A.K. (eds) Proceedings of World Conference on Artificial Intelligence: Advances and Applications. WWCA 1997. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-99-5881-8_7

Download citation

Publish with us

Policies and ethics

Navigation