Log in

The classification of medical and botanical data through majority voting using artificial neural network

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

Data classification has many approaches in data mining and machine learning. The artificial neural network (ANN) is applied to classify the data that might belong to various domains like chemical, botanical, medical, spatial, textual, and image. In this work, an ANN technique is applied to the 7 Life sciences (botanical and medical) data sets extracted from public data repositories. Various optimization approaches like exhaustive validation, cross-validation, and multiple seeding are used to discover the most optimized networks for the given datasets. Finally, voting predicts the class where the whole dataset is used as a test set instead of folds. The results obtained by the proposed approach outperform other approaches on all the datasets. Cleveland’s heart, Statlog heart, Dermatology, Hepatitis, Seeds, Abalone and Vertebral Column data sets (all of UCI) after applying the voting showed the accuracy of 94.61%, 93.7%, 99.73%, 96.77%, 99.05%, 89.37% and 90.32% respectively. In the future deep neural network may be used to improve the results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Data availability

Data will be available on request to the corresponding author.

References

  1. Khanday AMUD, Rabani ST, Khan QR, Rouf N, Mohi Ud Din M (2020) Machine learning based approaches for detecting covid 19 using clinical text data. Int J Inf Technol 12(3):731–739

  2. Aziz RM, Baluch MF, Patel S, Ganie AH (2022) Lgbm: a machine learning approach for ethereum fraud detection. Int J Inf Technol 1–11

  3. Khanday AMUD, Khan QR, Rabani ST (2021) Identifying propaganda from online social networks during covid 19 using machine learning techniques. Int J Inf Technol 13(1):115–122

    Google Scholar 

  4. Song C-H (2022) A hybrid sem and ann approach to predict the individual cloud computing adoption based on the utaut2. Int J Inf Technol 1–15

  5. Khanday AMUD, Bhushan B, Jhaveri RH, Khan QR, Raut R, Rabani ST (2022) Nnpcov19: artificial neural network-based propaganda identification on social media in covid-19 era. Mob Inf Syst 2022:1–10

    Google Scholar 

  6. Tripathi K, Vyas RG, Gupta AK (2019) Deep learning through convolutional neural networks for classification of image: a novel approach using hyper filter. Int J Comput Sci Eng 7(6):164–168

    Google Scholar 

  7. Tripathi K, Vyas RG, Gupta AK (2018) The classification of data: a novel artificial neural network (ann) approach through exhaustive validation and weight initialization. Int J Comput Sci Eng 6(5):241–254

    Google Scholar 

  8. Repository UML. “Uci machine learning repository” [Online]. http://archive.ics.uci.edu/ml/datasets

  9. Unal Y, Polat K, Kocer HE (2014) Pairwise fcm based feature weighting for improved classification of vertebral column disorders. Comput Biol Med 46:61–70

    Article  Google Scholar 

  10. Lee S-H (2015) Feature selection based on the center of gravity of bswfms using newfm. Eng Appl Artif Intell 45:482–487

    Article  Google Scholar 

  11. Rabani ST, Khanday AMUD, Khan QR, Hajam UA, Imran AS, Kastrati Z (2023) Detecting suicidality on social media: machine learning at rescue. Egypt Inform J 24(2):291–302

    Article  Google Scholar 

  12. Paul AK, Shill PC, Rabin MRI, Akhand M (2016) Genetic algorithm based fuzzy decision support system for the diagnosis of heart disease. In: 2016 5th international conference on informatics, electronics and vision (ICIEV). IEEE, pp 145–150

  13. Verma L, Srivastava S, Negi P (2016) A hybrid data mining model to predict coronary artery disease cases using non-invasive clinical data. J Med Syst 40(7):1–7

    Article  Google Scholar 

  14. Gupta A, Kumar R, Arora HS, Raman B (2019) Mifh: a machine intelligence framework for heart disease diagnosis. IEEE Access 8:14659–14674

    Article  Google Scholar 

  15. Abdeldjouad FZ, Brahami M, Matta N (2020) A hybrid approach for heart disease diagnosis and prediction using machine learning techniques. In: International conference on smart homes and health telematics. Springer, pp 299–306

  16. Tougui I, Jilbab A, El Mhamdi J (2020) Heart disease classification using data mining tools and machine learning techniques. Health Technol 10(5):1137–1144

    Article  Google Scholar 

  17. Krishnaiah V, Narsimha G, Chandra NS (2015) Heart disease prediction system using data mining technique by fuzzy k-nn approach. In: Emerging ICT for bridging the future-proceedings of the 49th annual convention of the Computer Society of India (CSI), vol 1. Springer, pp 371–384

  18. Kahramanli H, Allahverdi N (2008) Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl 35(1–2):82–89

    Article  Google Scholar 

  19. Liu X, Wang X, Su Q, Zhang M, Zhu Y, Wang Q, Wang Q (2017) A hybrid classification system for heart disease diagnosis based on the rfrs method. In: Computational and mathematical methods in medicine, vol 2017

  20. Tomar D, Agarwal S (2014) Feature selection based least square twin support vector machine for diagnosis of heart disease. Int J Bio-Sci Bio-Technol 6(2):69–82

    Article  Google Scholar 

  21. Verma AK, Pal S, Kumar S (2019) Classification of skin disease using ensemble data mining techniques. Asian Pac J Cancer Prev APJCP 20(6):1887

    Article  Google Scholar 

  22. Bascil MS, Temurtas F (2011) A study on hepatitis disease diagnosis using multilayer neural network with Levenberg Marquardt training algorithm. J Med Syst 35(3):433–436

    Article  Google Scholar 

  23. Sabanci K, Akkaya M (2016) Classification of different wheat varieties by using data mining algorithms. Int J Intell Syst Appl Eng 4(2):40–44

    Article  Google Scholar 

  24. Kumar N, Kumar D (2021) An improved grey wolf optimization-based learning of artificial neural network for medical data classification. J Inf Commun Technol 20(2):213–248

    Google Scholar 

  25. Kalagotla SK, Gangashetty SV, Giridhar K (2021) A novel stacking technique for prediction of diabetes. Comput Biol Med 135:104554

    Article  Google Scholar 

  26. Khatri A, Agrawal S, Chatterjee JM (2022) Wheat seed classification: utilizing ensemble machine learning approach. In: Scientific programming, vol 2022

  27. Madhavan J, Salim M, Durairaj U, Kotteeswaran R (2021) Wheat seed classification using neural network pattern recognizer. Mater Today Proc

  28. Rawat J, Virmani J, Singh A, Bhadauria HS, Kumar I, Devgan J (2020) Fab classification of acute leukemia using an ensemble of neural networks. Evolut Intell 1–19

  29. Siouda R, Nemissi M, Seridi H (2022) Diverse activation functions based-hybrid rbf-elm neural network for medical classification. Evolut Intell 1–17

  30. Reddy GT, Reddy M, Lakshmanna K, Rajput DS, Kaluri R, Srivastava G (2020) Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis. Evolut Intell 13(2):185–196

    Article  Google Scholar 

  31. Saxena S, Mohapatra D, Padhee S, Sahoo GK (2021) “Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms,” Evolutionary Intelligence, pp. 1–17

  32. Reddy SS, Sethi N, Rajender R (2021) Mining of multiple ailments correlated to diabetes mellitus. Evolut Intell 14(2):733–740

    Article  Google Scholar 

  33. Kusy M, Obrzut B, Kluska J (2013) Application of gene expression programming and neural networks to predict adverse events of radical hysterectomy in cervical cancer patients. Med Biol Eng Comput 51(12):1357–1365

    Article  Google Scholar 

  34. Chatterjee S, Dey N, Shi F, Ashour AS, Fong SJ, Sen S (2018) Clinical application of modified bag-of-features coupled with hybrid neural-based classifier in dengue fever classification using gene expression data. Med Biol Eng Comput 56(4):709–720

    Article  Google Scholar 

  35. Bahremand S, Ko HS, Balouchzadeh R, Felix Lee H, Park S, Kwon G (2019) Neural network-based model predictive control for type 1 diabetic rats on artificial pancreas system. Med Biol Eng Comput 57(1):177–191

    Article  Google Scholar 

  36. Yavuz E, Eyupoglu C (2020) An effective approach for breast cancer diagnosis based on routine blood analysis features. Med Biol Eng Comput 58(7):1583–1601

    Article  Google Scholar 

  37. De Bois M, Yacoubi MAE, Ammi M (2021) Glyfe: review and benchmark of personalized glucose predictive models in type 1 diabetes. Med Biol Eng Comput 1–17

  38. Saadatmand S, Salimifard K, Mohammadi R, Marzban M, Naghibzadeh-Tahami A (2022) Predicting the necessity of oxygen therapy in the early stage of covid-19 using machine learning. Med Biol Eng Comput 60(4):957–968

    Article  Google Scholar 

  39. Harikrishnan N, Pranay S, Nagaraj N (2022) Classification of sars-cov-2 viral genome sequences using neurochaos learning. Med Biol Eng Comput 1–11

  40. Mikhailova V, Anbarjafari G (2022) Comparative analysis of classification algorithms on the breast cancer recurrence using machine learning. Med Biol Eng Comput 60(9):2589–2600

    Article  Google Scholar 

  41. Latha CBC, Jeeva SC (2019) Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques. Inform Med Unlocked 16:100203

    Article  Google Scholar 

  42. Subbulakshmi C, Deepa S, Malathi N (2012) Extreme learning machine for two category data classification. In: 2012 IEEE international conference on advanced communication control and computing technologies (ICACCCT). IEEE, pp 458–461

  43. El-Bialy R, Salamay MA, Karam OH, Khalifa ME (2015) Feature analysis of coronary artery heart disease data sets. Proc Comput Sci 65:459–468

    Article  Google Scholar 

  44. Neshat M, Sargolzaei M, Nadjaran Toosi A, Masoumi A (2012) Hepatitis disease diagnosis using hybrid case based reasoning and particle swarm optimization. Int Sch Res Not 2012

  45. Akbar W, Wu W-p, Saleem S, Farhan M, Saleem MA, Javeed A, Ali L (2020) Development of hepatitis disease detection system by exploiting sparsity in linear support vector machine to improve strength of adaboost ensemble model. Mob Inf Syst 2020

  46. Alshdaifat E, Alshdaifat D, Alsarhan A, Hussein F, El-Salhi SMFS et al (2021) The effect of preprocessing techniques, applied to numeric features, on classification algorithms’ performance. Data 6(2):11

    Article  Google Scholar 

  47. Guney S, Kilinc I, Hameed AA, Jamil A (2022) Abalone age prediction using machine learning. In: Mediterranean conference on pattern recognition and artificial intelligence. Springer, pp 329–338

  48. Sahin E, Saul CJ, Ozsarfati E, Yilmaz A (2018) Abalone life phase classification with deep learning. In: 2018 5th international conference on soft computing & machine intelligence (ISCMI). IEEE, pp 163–167

  49. Dubey AK, Choudhary K, Sharma R (2021) Predicting heart disease based on influential features with machine learning. Intell Autom Soft Comput 30(3):929–943

    Article  Google Scholar 

  50. Buscema M, Breda M, Lodwick W (2013) Training with input selection and testing (twist) algorithm: a significant advance in pattern recognition performance of machine learning

  51. Nahar J, Imam T, Tickle KS, Chen Y-PP (2013) Computational intelligence for heart disease diagnosis: a medical knowledge driven approach. Expert Syst Appl 40(1):96–104

    Article  Google Scholar 

  52. Shah SMS, Batool S, Khan I, Ashraf MU, Abbas SH, Hussain SA (2017) Feature extraction through parallel probabilistic principal component analysis for heart disease diagnosis. Phys A 482:796–807

    Article  MATH  Google Scholar 

  53. Vijayashree J, Sultana HP (2018) A machine learning framework for feature selection in heart disease classification using improved particle swarm optimization with support vector machine classifier. Program Comput Softw 44(6):388–397

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kshitij Tripathi.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tripathi, K., Khan, F.A., Khanday, A.M.U.D. et al. The classification of medical and botanical data through majority voting using artificial neural network. Int. j. inf. tecnol. 15, 3271–3283 (2023). https://doi.org/10.1007/s41870-023-01361-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-023-01361-0

Keywords

Navigation