Abstract
Food-borne diseases have a high worldwide occurrence, substantially impacting public health and the social economy. Most food-borne diseases are contagious or poisonous and are caused by bacteria, viruses or chemicals that enter the body via contaminated food. The most prevalent harmful bacteria (Salmonella, Escherichia coli, Campylobacter, Clostridium and Listeria) and viruses (norovirus) may cause acute poisoning or chronic disorders such as cancer. Thus, the detection of pathogenic organisms is crucial for the safety of food. Artificial intelligence has recently been an effective technique for predicting pathogens spreading food-borne diseases. This study compares and contrasts the accuracy of many popular methods for making predictions about the pathogens in food-borne diseases, including decision trees, random forests, k-Nearest Neighbors, stochastic gradient descent and extremely randomized trees, along with an ensemble model incorporating all of these approaches. In addition, principal component analysis and scaling methods were used to normalize and rescale the values of the target variable in order to increase the prediction rate. The performance of classification systems has been examined using precision, accuracy, recall, F1-score and root mean square error (RMSE). The experimental results demonstrate that the suggested new ensemble model beat all other classifiers and achieved the average highest 97.26% accuracy, 0.22 RMSE value, 97.77% recall, 97.66% precision and 98.44% F1-Score. This research investigates the predictability of pathogens in food-borne diseases using ensemble learning techniques.
Similar content being viewed by others
Data Availability
Not applicable.
References
Finger JA, Baroni WS, Maffei DF, Bastos DH, Pinto UM (2019) Overview of foodborne disease outbreaks in Brazil from 2000 to 2018. Foods 8(10):434
Sharif MK, Javed K, Nasir A (2018) Foodborne illness: threats and control. Foodborne diseases. Academic Press, Cambridge, pp 501–523
Jung Y, Jang H, Matthews KR (2014) Effect of the food production chain from farm practices to vegetable processing on outbreak incidence. Microb Biotechnol 7(6):517–527
Kaur I, Garg R, Kaur T, Mathur G (2023) Using artificial intelligence to predict clinical requirements in healthcare. J Pharm Negat Results 2023:4177–4180
Vidyadharani G, Vijaya Bhavadharani HK, Sathishnath P, Ramanathan S, Sariga P, Sandhya A, Sugumar S (2022) Present and pioneer methods of early detection of food borne pathogens. J Food Sci Technol 59(6):2087–2107
Torgerson PR, Devleesschauwer B, Praet N, Speybroeck N, Willingham AL, Kasuga F, de Silva N (2015) World Health Organization estimates of the global and regional disease burden of 11 foodborne parasitic diseases, 2010: a data synthesis. PLoS Med 12(12):e1001920
Vilne B, Meistere I, Grantiņa-Ieviņa L, Ķibilds J (2019) Machine learning approaches for epidemiological investigations of food-borne disease outbreaks. Front Microbiol 10:1722
Kadariya J, Smith TC, Thapaliya D (2014) Staphylococcus aureus and staphylococcal food-borne disease: an ongoing challenge in public health. BioMed Res Int 2014:1–9
Wang H, Cui W, Guo Y, Du Y, Zhou Y (2021) Machine learning prediction of foodborne disease pathogens: algorithm development and validation study. JMIR Med Inform 9(1):e24924
Pandey SK, Bhandari AK (2023) A systematic review of modern approaches in healthcare systems for lung cancer detection and classification. Archiv Comput Methods Eng 30:1–20
Kumar Y, Gupta S (2023) Deep transfer learning approaches to predict glaucoma, cataract, choroidal neovascularization, diabetic macular edema, drusen and healthy eyes: an experimental review. Archiv Comput Methods Eng 30(1):521–541
Heredia N, García S (2018) Animals as sources of food-borne pathogens: a review. Animal nutrition 4(3):250–255
Saravanan A, Kumar PS, Hemavathy RV, Jeevanantham S, Kamalesh R, Sneha S, Yaashikaa PR (2021) Methods of detection of food-borne pathogens: a review. Environ Chem Lett 19:189–207
Chukwu EE, Nwaokorie FO, Coker AO, Avila-Campos MJ, Ogunsola FT (2019) 16S rRNA gene sequencing: a practical approach to confirming the identity of food borne bacteria. IFE J Sci 21(3):13–25
Koul A, Bawa RK, Kumar Y (2023) Artificial intelligence techniques to predict the airway disorders illness: a systematic review. Archiv Comput Methods Eng 30(2):831–864
Hu W, Feng K, Jiang A, **u Z, Lao Y, Li Y, Long Y (2020) An in situ-synthesized gene chip for the detection of food-borne pathogens on fresh-cut cantaloupe and lettuce. Front Microbiol 10:3089
Nesakumar N, Lakshmanakumar M, Srinivasan S, Jayalatha Jbb A, Balaguru Rayappan JB (2021) Principles and recent advances in biosensors for pathogens detection. ChemistrySelect 6(37):10063–10091
Zheng S, Yang Q, Yang H, Zhang Y, Guo W, Zhang W (2023) An ultrasensitive and specific ratiometric electrochemical biosensor based on SRCA-CRISPR/Cas12a system for detection of Salmonella in food. Food Control 146:109528
Chenar SS, Deng Z (2021) Hybrid modeling and prediction of oyster norovirus outbreaks. J Water Health 19(2):254–266
Zhang P, Cui W, Wang H, Du Y, Zhou Y (2021) High-efficiency machine learning method for identifying foodborne disease outbreaks and confounding factors. Foodborne Pathog Dis 18(8):590–598
Chenar SS, Deng Z (2018) Development of artificial intelligence approach to forecasting oyster norovirus outbreaks along Gulf of Mexico coast. Environ Int 111:212–223
Min HJ, Mina HA, Deering AJ, Bae E (2021) Development of a smartphone-based lateral-flow imaging system using machine-learning classifiers for detection of Salmonella spp. J Microbiol Methods 188:106288
Nguyen M, Long SW, McDermott PF, Olsen RJ, Olson R, Stevens RL, Davis JJ (2018) Using machine learning to predict antimicrobial minimum inhibitory concentrations and associated genomic features for nontyphoidal Salmonella. bioRxiv 2018:380782
Polat H, Topalcengiz Z, Danyluk MD (2020) Prediction of Salmonella presence and absence in agricultural surface waters by artificial intelligence approaches. J Food Saf 40(1):e12733
Amado TM, Bunuan MR, Chicote RF, Espenida SMC, Masangcay HL, Ventura CH, Enriquez LAC (2019) Development of predictive models using machine learning algorithms for food adulterants bacteria detection. 2019 IEEE 11th international conference on humanoid, nanotechnology, information technology, communication and control, environment, and management (HNICEM). IEEE, New York, pp 1–6
Lupolova N, Dallman TJ, Holden NJ, Gally DL (2017) Patchy promiscuity: machine learning applied to predict the host specificity of Salmonella enterica and Escherichia coli. Microb Genom. https://doi.org/10.1099/mgen.0.000135
Hiura S, Koseki S, Koyama K (2021) Prediction of population behavior of Listeria monocytogenes in food using machine learning and a microbial growth and survival database. Sci Rep 11(1):1–11
Njage PMK, Henri C, Leekitcharoenphon P, Mistou MY, Hendriksen RS, Hald T (2019) Machine learning methods as a tool for predicting risk of illness applying next-generation sequencing data. Risk Anal 39(6):1397–1413
Borujeni MS, Ghaderi-Zefrehei M, Ghanegolmohammadi F, Ansari-Mahyari S (2018) A novel LSSVM based algorithm to increase accuracy of bacterial growth modeling. Iran J Biotech 16(2):105
Bandoy DJ, Weimer BC (2020) Biological machine learning combined with campylobacter population genomics reveals virulence gene allelic variants cause disease. Microorganisms 8(4):549
Hill AA, Crotta M, Wall B, Good L, O’Brien SJ, Guitian J (2017) Towards an integrated food safety surveillance system: a simulation study to explore the potential of combining genomic and epidemiological metadata. Royal Soc Open Sci 4(3):160721
Maharana A, Cai K, Hellerstein J, Hswen Y, Munsell M, Staneva V, Nsoesie EO (2019) Detecting reports of unsafe foods in consumer product reviews. JAMIA Open 2(3):330–338
Olm MR, Bhattacharya N, Crits-Christoph A, Firek BA, Baker R, Song YS, Banfield JF (2019) Necrotizing enterocolitis is preceded by increased gut bacterial replication, Klebsiella, and fimbriae-encoding bacteria. Sci Adv 5(12):eaax5727
Nogales A, Morón RD, García-Tejedor ÁJ (2020) Food safety risk prediction with Deep Learning models using categorical embeddings on European Union data. Preprint at https://arxiv.org/abs/2009.06704
Ahsan MM, Mahmud MA, Saha PK, Gupta KD, Siddique Z (2021) Effect of data scaling methods on machine learning algorithms and model performance. Technologies 9(3):52
Rudra T, Paul P (2021) Heart disease prediction using traditional machine learning.
Kaur I, Sandhu AK, Kumar Y (2022) A hybrid deep transfer learning approach for the detection of vector-borne diseases. 2022 5th international conference on contemporary computing and informatics (IC3I). IEEE, New York, pp 2189–2194
Peng T, Chen X, Wan M, ** L, Wang X, Du X, Yang X (2021) The prediction of hepatitis E through ensemble learning. Int J Environ Res Public Health 18(1):159
Nogales A, Díaz-Morón R, García-Tejedor ÁJ (2022) A comparison of neural and non-neural machine learning models for food safety risk prediction with European Union RASFF data. Food Control 134:108697
Wheeler NE (2019) Tracing outbreaks with machine learning. Nat Rev Microbiol 17(5):269–269
Martínez-García PM, López-Solanilla E, Ramos C, Rodríguez-Palenzuela P (2016) Prediction of bacterial associations with plants using a supervised machine-learning approach. Environ Microbiol 18(12):4847–4861
Bhardwaj P, Bhandari G, Kumar Y, Gupta S (2022) An investigational approach for the prediction of gastric cancer using artificial intelligence techniques: a systematic review. Archiv Comput Methods Eng 29:1–22
Lumogdang CFD, Wata MG, Loyola SJS, Angelia RE, Angelia HLP (2019) Supervised machine learning approach for pork meat freshness identification. Proceedings of the 2019 6th international conference on bioinformatics research and applications. ACM, New York, pp 1–6
Chowdhury NH, Reaz MBI, Haque F, Ahmad S, Ali SHM, Bakar AAA, Bhuiyan MAS (2021) Performance analysis of conventional machine learning algorithms for identification of chronic kidney disease in type 1 diabetes mellitus patients. Diagnostics 11(12):2267
Kader MS, Ahmed F, Akter J (2021) Machine learning techniques to precaution of emerging disease in the poultry industry. 2021 24th international conference on computer and information technology (ICCIT). IEEE, New York, pp 1–6
Rani P, Kumar R, Jain A (2021) Coronary artery disease diagnosis using extra tree-support vector machine: ET-SVMRBF. Int J Comput Appl Technol 66(2):209–218
Ali L, Niamat A, Khan JA, Golilarz NA, **ngzhong X, Noor A, Bukhari SAC (2019) An optimized stacked support vector machines based expert system for the effective prediction of heart failure. IEEE Access 7:54007–54014
Weller DL, Love T, Wiedmann M (2021) Interpretability versus accuracy: a comparison of machine learning models built using different algorithms, performance measures, and features to predict E. coli levels in agricultural water. Front Artif Intell 4:19
Goyal P, Gopala Krishna DN, Jain D, Rathi M (2021) Foodborne disease outbreak prediction using deep learning. Innovations in computational intelligence and computer vision. Springer, Singapore, pp 165–172
Kaur I, Kumar Y, Sandhu AK, Ijaz MF (2023) Predictive modeling of epidemic diseases based on vector-borne diseases using artificial intelligence techniques. Computational intelligence in medical decision making and diagnosis. CRC Press, Boca Raton, pp 81–100
Singh PD, Kaur R, Singh KD, Dhiman G (2021) A novel ensemble-based classifier for detecting the COVID-19 disease for infected patients. Inf Syst Front 23(6):1385–1401
Kaur I, Sandhu AK, Kumar Y (2022) Artificial intelligence techniques for predictive modeling of vector-borne diseases and its pathogens: a systematic review. Archiv Comput Methods Eng 29:1–31
Kaur I, Sandhu AK, Kumar Y (2021) Analyzing and minimizing the effects of Vector-borne diseases using machine and deep learning techniques: a systematic review. 2021 sixth international conference on image information processing (ICIIP), vol 6. IEEE, New York, pp 69–74
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
Non-financial interests and authors have no conflict of interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kumar, Y., Kaur, I. & Mishra, S. Foodborne Disease Symptoms, Diagnostics, and Predictions Using Artificial Intelligence-Based Learning Approaches: A Systematic Review. Arch Computat Methods Eng 31, 553–578 (2024). https://doi.org/10.1007/s11831-023-09991-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11831-023-09991-0