Symptom Based Health Status Prediction via Decision Tree, KNN, XGBoost, LDA, SVM, and Random Forest

  • Conference paper
  • First Online:
Computational Intelligence, Data Analytics and Applications (ICCIDA 2022)

Abstract

Machine learning applications in health science become more important and necessary every day. With the help of these systems, the load of the medical staff will be lessened and faults because of a missing point, or tiredness will decrease. It should not be forgotten that the last decision lies with the professionals, and these systems will only help in decision-making. Predicting diseases with the help of machine learning algorithm can lessen the load of the medical staff. This paper proposes a machine learning model that analyzes healthcare data from a variety of diseases and shows the result from the best resulting algorithm in the model. It is aimed to have a system that facilitates the diagnosis of diseases caused by the density of data in the health field by using these algorithms of previously diagnosed symptoms, thus resulting in doctors going a faster way while diagnosing the disease and have a prediction about the diseases of people who do not have the condition to go to the hospital. In this way, it can ease the burden on health systems. The disease outcome corresponding to the 11 symptoms found in the data set used is previously experienced results. During the study, different ML algorithms such as Decision Tree, Random Forest, KNN, XGBoost, SVM, LDA were tried and compatibility/performance comparisons were made on the dataset used. The results are presented in a table. As a result of these comparisons and evaluations, it was seen that Random Forest Algorithm gave the best performance. While data was being processed, input parameters were provided to each model, and disease was taken as output. Within this limited resource, our model has reached an accuracy rate of 98%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now
Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Akhtar, N.: Heart Disease Prediction (2021)

    Google Scholar 

  2. Jany Shabu, S.L., Nithin, M.S., Santhosh, M., Roobini, M.S., Mohana Prasad, K., Joshila Grace, L.K.: Skin disease prediction. J. Comput. Theor. Nanosci. 17(8), 3458–3462 (2020)

    Article  Google Scholar 

  3. Shilimkar, G., Shivam, P.: Disease prediction using machine learning. Int. J. Sci. Res. Sci. Technol. 8(3), 551–555 (2021)

    Google Scholar 

  4. Tamal, M.A., Islam, M.S., Ahmmed, M.J., Aziz, M.A., Miah, P., Karim, M.R.: Heart disease prediction based on external factors: a machine learning approach. Int. J. Adv. Comput. Sci. Appl. 10 (2019) https://doi.org/10.14569/IJACSA.2019.0101260

  5. Rajora, H., Punn, N.S., Sonbhadra, S.K., Agarwal, S.: Web based disease prediction and recommender system (2021)

    Google Scholar 

  6. John, R.: An application of machine learning in IVF: comparing the accuracy of classification algorithms for the prediction of twins. Gynecol. Obstet. 9(497), 0932–2161 (2019). https://doi.org/10.4172/2161-0932.1000497

    Article  Google Scholar 

  7. Lee, R., Chitnis, C.: Improving health-care systems by disease prediction. In: 2018 International Conference on Computational Science and Computational Intelligence (CSCI), pp. 726–731 (2018). https://doi.org/10.1109/CSCI46756.2018.00145

  8. Shetty, S.V., Karthik, G.A., Ashwin, M.: Symptom based health prediction using data mining. In: 2019 International Conference on Communication and Electronics Systems (ICCES), pp. 744–749 (2019). https://doi.org/10.1109/ICCES45898.2019.9002132

  9. Joshi, T.N., Chawan, P.M.: Logistic regression and SVM based diabetes prediction system. Int. J. Technol. Res. Eng. 5, 4347–4350 (2018)

    Google Scholar 

  10. Lafta, R., Zhang, J., Tao, X., Li, Y., Tseng, V.S.: An intelligent recommender system based on short-term risk prediction for heart disease patients. In: 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), vol. 3, pp. 102–105. IEEE (2015). https://doi.org/10.1109/WI-IAT.2015.47

  11. Baig, M., Nadeem, M.: Diabetes prediction using machine learning algorithms (2020). https://doi.org/10.13140/RG.2.2.18158.64328

  12. https://www.kaggle.com/itachi9604/disease-symptom-description-dataset

  13. Mujumdar, A., Vaidehi, V.: Diabetes prediction using machine learning algorithms. Int. Conf. Recent Trends Adv. Comput. ICRTAC 165, 292–299 (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elif Meriç .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Meriç, E., Özer, Ç. (2023). Symptom Based Health Status Prediction via Decision Tree, KNN, XGBoost, LDA, SVM, and Random Forest. In: García Márquez, F.P., Jamil, A., Eken, S., Hameed, A.A. (eds) Computational Intelligence, Data Analytics and Applications. ICCIDA 2022. Lecture Notes in Networks and Systems, vol 643. Springer, Cham. https://doi.org/10.1007/978-3-031-27099-4_15

Download citation

Publish with us

Policies and ethics

Navigation