Comparative Analysis of CatBoost Against Machine Learning Algorithms for Classification of Altered NSL-KDD

  • Conference paper
  • First Online:
Proceedings of the Fifth International Conference on Trends in Computational and Cognitive Engineering (TCCE 2023)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 961))

  • 23 Accesses

Abstract

The importance of information security, especially in network environments, has increased as a result of the expanding volume of data stored in computer systems. The categorization of data instances into two classes using binary classification, a fundamental machine learning activity, facilitates efficient decision-making and risk assessment. The performance of standard classifiers, logistic regression, Gaussian NB, and support vector machine (SVM) is compared with that of CatBoost, a gradient boosting-based classifier known for handling categorical variables and reducing overfitting. The NSL-KDD dataset is used for the evaluation. Results show that CatBoost performs better than conventional classifiers, with higher accuracy, precision, and recall. CatBoost's ability to classify network connections accurately is demonstrated by its training and testing accuracy results, which surpass 97%. Furthermore, its high true positive rate and low false positive rate attest to its competence in accurately identifying network threats.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 171.19
Price includes VAT (Germany)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 213.99
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Perwej Y, Abbas SQ, Dixit JP, Akhtar N, Jaiswal AK (2021) A systematic literature review on the cyber security. Int J Scient Res Managem 9(12):669–710

    Article  Google Scholar 

  2. Sarker IH (2021) Machine learning: algorithms, real-world applications and research directions. SN Comput Sci 2(3):160

    Article  Google Scholar 

  3. Tavallaee M, Bagheri E, Lu W, Ghorbani AA (2009) A detailed analysis of the KDD CUP 99 data set. In: 2009 IEEE symposium on computational intelligence for security and defense applications, July, IEEE, pp 1–6

    Google Scholar 

  4. Hancock JT, Khoshgoftaar TM (2020) CatBoost for big data: an interdisciplinary

    Google Scholar 

  5. Kasongo SM (2023) A deep learning technique for intrusion detection system using a recurrent neural networks based framework. Comput Commun 199:113–125

    Article  Google Scholar 

  6. Ding Y, Zhai Y (2018) Intrusion detection system for NSL-KDD dataset using convolutional neural networks. In: Proceedings of the 2018 2nd international conference on computer science and artificial intelligence, December, pp 81–85

    Google Scholar 

  7. Hota HS, Shrivas AK (2014) Decision tree techniques applied on NSL-KDD data and its comparison with various feature selection techniques. In: Advanced computing, networking and informatics-Volume 1: advanced computing and informatics proceedings of the second international conference on advanced computing, networking and informatics (ICACNI-2014). Springer International Publishing, pp 205–211

    Google Scholar 

  8. Umer MA, Junejo KN, Jilani MT, Mathur AP (2022) Machine learning for intrusion detection in industrial control systems: applications, challenges, and recommendations. Int J Crit Infrastruct Prot 38:100516

    Article  Google Scholar 

  9. Saheed YK, Abiodun AI, Misra S, Holone MK, Colomo-Palacios R (2022) A machine learning-based intrusion detection for detecting internet of things network attacks. Alex Eng J 61(12):9395–9409

    Article  Google Scholar 

  10. Zhang C, Jia D, Wang L, Wang W, Liu F, Yang A (2022) Comparative research on network intrusion detection methods based on machine learning. Comput Secur 102861

    Google Scholar 

  11. Dhananjay B, Sivaraman J (2021) Analysis and classification of heart rate using CatBoost feature ranking model. Biomed Signal Process Control 68:102610

    Article  Google Scholar 

  12. Jabeur SB, Gharib C, Mefteh-Wali S, Arfi WB (2021) CatBoost model and artificial intelligence techniques for corporate failure prediction. Technol Forecast Soc Chang 166:120658

    Article  Google Scholar 

  13. Hancock JT, Khoshgoftaar TM (2020) CatBoost for big data: an interdisciplinary review. J Big Data 7(1):1–45

    Article  Google Scholar 

Download references

Acknowledgements

The authors thank Sharmin Sultana and Nayer Sultana for their analysis and guidance.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nadia Ahmed Sharna .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sharna, N.A., Islam, E. (2024). Comparative Analysis of CatBoost Against Machine Learning Algorithms for Classification of Altered NSL-KDD. In: Kaiser, M.S., Singh, R., Bandyopadhyay, A., Mahmud, M., Ray, K. (eds) Proceedings of the Fifth International Conference on Trends in Computational and Cognitive Engineering. TCCE 2023. Lecture Notes in Networks and Systems, vol 961. Springer, Singapore. https://doi.org/10.1007/978-981-97-1923-5_24

Download citation

Publish with us

Policies and ethics

Navigation