Abstract
The importance of information security, especially in network environments, has increased as a result of the expanding volume of data stored in computer systems. The categorization of data instances into two classes using binary classification, a fundamental machine learning activity, facilitates efficient decision-making and risk assessment. The performance of standard classifiers, logistic regression, Gaussian NB, and support vector machine (SVM) is compared with that of CatBoost, a gradient boosting-based classifier known for handling categorical variables and reducing overfitting. The NSL-KDD dataset is used for the evaluation. Results show that CatBoost performs better than conventional classifiers, with higher accuracy, precision, and recall. CatBoost's ability to classify network connections accurately is demonstrated by its training and testing accuracy results, which surpass 97%. Furthermore, its high true positive rate and low false positive rate attest to its competence in accurately identifying network threats.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Perwej Y, Abbas SQ, Dixit JP, Akhtar N, Jaiswal AK (2021) A systematic literature review on the cyber security. Int J Scient Res Managem 9(12):669–710
Sarker IH (2021) Machine learning: algorithms, real-world applications and research directions. SN Comput Sci 2(3):160
Tavallaee M, Bagheri E, Lu W, Ghorbani AA (2009) A detailed analysis of the KDD CUP 99 data set. In: 2009 IEEE symposium on computational intelligence for security and defense applications, July, IEEE, pp 1–6
Hancock JT, Khoshgoftaar TM (2020) CatBoost for big data: an interdisciplinary
Kasongo SM (2023) A deep learning technique for intrusion detection system using a recurrent neural networks based framework. Comput Commun 199:113–125
Ding Y, Zhai Y (2018) Intrusion detection system for NSL-KDD dataset using convolutional neural networks. In: Proceedings of the 2018 2nd international conference on computer science and artificial intelligence, December, pp 81–85
Hota HS, Shrivas AK (2014) Decision tree techniques applied on NSL-KDD data and its comparison with various feature selection techniques. In: Advanced computing, networking and informatics-Volume 1: advanced computing and informatics proceedings of the second international conference on advanced computing, networking and informatics (ICACNI-2014). Springer International Publishing, pp 205–211
Umer MA, Junejo KN, Jilani MT, Mathur AP (2022) Machine learning for intrusion detection in industrial control systems: applications, challenges, and recommendations. Int J Crit Infrastruct Prot 38:100516
Saheed YK, Abiodun AI, Misra S, Holone MK, Colomo-Palacios R (2022) A machine learning-based intrusion detection for detecting internet of things network attacks. Alex Eng J 61(12):9395–9409
Zhang C, Jia D, Wang L, Wang W, Liu F, Yang A (2022) Comparative research on network intrusion detection methods based on machine learning. Comput Secur 102861
Dhananjay B, Sivaraman J (2021) Analysis and classification of heart rate using CatBoost feature ranking model. Biomed Signal Process Control 68:102610
Jabeur SB, Gharib C, Mefteh-Wali S, Arfi WB (2021) CatBoost model and artificial intelligence techniques for corporate failure prediction. Technol Forecast Soc Chang 166:120658
Hancock JT, Khoshgoftaar TM (2020) CatBoost for big data: an interdisciplinary review. J Big Data 7(1):1–45
Acknowledgements
The authors thank Sharmin Sultana and Nayer Sultana for their analysis and guidance.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sharna, N.A., Islam, E. (2024). Comparative Analysis of CatBoost Against Machine Learning Algorithms for Classification of Altered NSL-KDD. In: Kaiser, M.S., Singh, R., Bandyopadhyay, A., Mahmud, M., Ray, K. (eds) Proceedings of the Fifth International Conference on Trends in Computational and Cognitive Engineering. TCCE 2023. Lecture Notes in Networks and Systems, vol 961. Springer, Singapore. https://doi.org/10.1007/978-981-97-1923-5_24
Download citation
DOI: https://doi.org/10.1007/978-981-97-1923-5_24
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-1922-8
Online ISBN: 978-981-97-1923-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)