Abstract
Data classification has many approaches in data mining and machine learning. The artificial neural network (ANN) is applied to classify the data that might belong to various domains like chemical, botanical, medical, spatial, textual, and image. In this work, an ANN technique is applied to the 7 Life sciences (botanical and medical) data sets extracted from public data repositories. Various optimization approaches like exhaustive validation, cross-validation, and multiple seeding are used to discover the most optimized networks for the given datasets. Finally, voting predicts the class where the whole dataset is used as a test set instead of folds. The results obtained by the proposed approach outperform other approaches on all the datasets. Cleveland’s heart, Statlog heart, Dermatology, Hepatitis, Seeds, Abalone and Vertebral Column data sets (all of UCI) after applying the voting showed the accuracy of 94.61%, 93.7%, 99.73%, 96.77%, 99.05%, 89.37% and 90.32% respectively. In the future deep neural network may be used to improve the results.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01361-0/MediaObjects/41870_2023_1361_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01361-0/MediaObjects/41870_2023_1361_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01361-0/MediaObjects/41870_2023_1361_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01361-0/MediaObjects/41870_2023_1361_Fig4_HTML.png)
Similar content being viewed by others
Data availability
Data will be available on request to the corresponding author.
References
Khanday AMUD, Rabani ST, Khan QR, Rouf N, Mohi Ud Din M (2020) Machine learning based approaches for detecting covid 19 using clinical text data. Int J Inf Technol 12(3):731–739
Aziz RM, Baluch MF, Patel S, Ganie AH (2022) Lgbm: a machine learning approach for ethereum fraud detection. Int J Inf Technol 1–11
Khanday AMUD, Khan QR, Rabani ST (2021) Identifying propaganda from online social networks during covid 19 using machine learning techniques. Int J Inf Technol 13(1):115–122
Song C-H (2022) A hybrid sem and ann approach to predict the individual cloud computing adoption based on the utaut2. Int J Inf Technol 1–15
Khanday AMUD, Bhushan B, Jhaveri RH, Khan QR, Raut R, Rabani ST (2022) Nnpcov19: artificial neural network-based propaganda identification on social media in covid-19 era. Mob Inf Syst 2022:1–10
Tripathi K, Vyas RG, Gupta AK (2019) Deep learning through convolutional neural networks for classification of image: a novel approach using hyper filter. Int J Comput Sci Eng 7(6):164–168
Tripathi K, Vyas RG, Gupta AK (2018) The classification of data: a novel artificial neural network (ann) approach through exhaustive validation and weight initialization. Int J Comput Sci Eng 6(5):241–254
Repository UML. “Uci machine learning repository” [Online]. http://archive.ics.uci.edu/ml/datasets
Unal Y, Polat K, Kocer HE (2014) Pairwise fcm based feature weighting for improved classification of vertebral column disorders. Comput Biol Med 46:61–70
Lee S-H (2015) Feature selection based on the center of gravity of bswfms using newfm. Eng Appl Artif Intell 45:482–487
Rabani ST, Khanday AMUD, Khan QR, Hajam UA, Imran AS, Kastrati Z (2023) Detecting suicidality on social media: machine learning at rescue. Egypt Inform J 24(2):291–302
Paul AK, Shill PC, Rabin MRI, Akhand M (2016) Genetic algorithm based fuzzy decision support system for the diagnosis of heart disease. In: 2016 5th international conference on informatics, electronics and vision (ICIEV). IEEE, pp 145–150
Verma L, Srivastava S, Negi P (2016) A hybrid data mining model to predict coronary artery disease cases using non-invasive clinical data. J Med Syst 40(7):1–7
Gupta A, Kumar R, Arora HS, Raman B (2019) Mifh: a machine intelligence framework for heart disease diagnosis. IEEE Access 8:14659–14674
Abdeldjouad FZ, Brahami M, Matta N (2020) A hybrid approach for heart disease diagnosis and prediction using machine learning techniques. In: International conference on smart homes and health telematics. Springer, pp 299–306
Tougui I, Jilbab A, El Mhamdi J (2020) Heart disease classification using data mining tools and machine learning techniques. Health Technol 10(5):1137–1144
Krishnaiah V, Narsimha G, Chandra NS (2015) Heart disease prediction system using data mining technique by fuzzy k-nn approach. In: Emerging ICT for bridging the future-proceedings of the 49th annual convention of the Computer Society of India (CSI), vol 1. Springer, pp 371–384
Kahramanli H, Allahverdi N (2008) Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl 35(1–2):82–89
Liu X, Wang X, Su Q, Zhang M, Zhu Y, Wang Q, Wang Q (2017) A hybrid classification system for heart disease diagnosis based on the rfrs method. In: Computational and mathematical methods in medicine, vol 2017
Tomar D, Agarwal S (2014) Feature selection based least square twin support vector machine for diagnosis of heart disease. Int J Bio-Sci Bio-Technol 6(2):69–82
Verma AK, Pal S, Kumar S (2019) Classification of skin disease using ensemble data mining techniques. Asian Pac J Cancer Prev APJCP 20(6):1887
Bascil MS, Temurtas F (2011) A study on hepatitis disease diagnosis using multilayer neural network with Levenberg Marquardt training algorithm. J Med Syst 35(3):433–436
Sabanci K, Akkaya M (2016) Classification of different wheat varieties by using data mining algorithms. Int J Intell Syst Appl Eng 4(2):40–44
Kumar N, Kumar D (2021) An improved grey wolf optimization-based learning of artificial neural network for medical data classification. J Inf Commun Technol 20(2):213–248
Kalagotla SK, Gangashetty SV, Giridhar K (2021) A novel stacking technique for prediction of diabetes. Comput Biol Med 135:104554
Khatri A, Agrawal S, Chatterjee JM (2022) Wheat seed classification: utilizing ensemble machine learning approach. In: Scientific programming, vol 2022
Madhavan J, Salim M, Durairaj U, Kotteeswaran R (2021) Wheat seed classification using neural network pattern recognizer. Mater Today Proc
Rawat J, Virmani J, Singh A, Bhadauria HS, Kumar I, Devgan J (2020) Fab classification of acute leukemia using an ensemble of neural networks. Evolut Intell 1–19
Siouda R, Nemissi M, Seridi H (2022) Diverse activation functions based-hybrid rbf-elm neural network for medical classification. Evolut Intell 1–17
Reddy GT, Reddy M, Lakshmanna K, Rajput DS, Kaluri R, Srivastava G (2020) Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis. Evolut Intell 13(2):185–196
Saxena S, Mohapatra D, Padhee S, Sahoo GK (2021) “Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms,” Evolutionary Intelligence, pp. 1–17
Reddy SS, Sethi N, Rajender R (2021) Mining of multiple ailments correlated to diabetes mellitus. Evolut Intell 14(2):733–740
Kusy M, Obrzut B, Kluska J (2013) Application of gene expression programming and neural networks to predict adverse events of radical hysterectomy in cervical cancer patients. Med Biol Eng Comput 51(12):1357–1365
Chatterjee S, Dey N, Shi F, Ashour AS, Fong SJ, Sen S (2018) Clinical application of modified bag-of-features coupled with hybrid neural-based classifier in dengue fever classification using gene expression data. Med Biol Eng Comput 56(4):709–720
Bahremand S, Ko HS, Balouchzadeh R, Felix Lee H, Park S, Kwon G (2019) Neural network-based model predictive control for type 1 diabetic rats on artificial pancreas system. Med Biol Eng Comput 57(1):177–191
Yavuz E, Eyupoglu C (2020) An effective approach for breast cancer diagnosis based on routine blood analysis features. Med Biol Eng Comput 58(7):1583–1601
De Bois M, Yacoubi MAE, Ammi M (2021) Glyfe: review and benchmark of personalized glucose predictive models in type 1 diabetes. Med Biol Eng Comput 1–17
Saadatmand S, Salimifard K, Mohammadi R, Marzban M, Naghibzadeh-Tahami A (2022) Predicting the necessity of oxygen therapy in the early stage of covid-19 using machine learning. Med Biol Eng Comput 60(4):957–968
Harikrishnan N, Pranay S, Nagaraj N (2022) Classification of sars-cov-2 viral genome sequences using neurochaos learning. Med Biol Eng Comput 1–11
Mikhailova V, Anbarjafari G (2022) Comparative analysis of classification algorithms on the breast cancer recurrence using machine learning. Med Biol Eng Comput 60(9):2589–2600
Latha CBC, Jeeva SC (2019) Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques. Inform Med Unlocked 16:100203
Subbulakshmi C, Deepa S, Malathi N (2012) Extreme learning machine for two category data classification. In: 2012 IEEE international conference on advanced communication control and computing technologies (ICACCCT). IEEE, pp 458–461
El-Bialy R, Salamay MA, Karam OH, Khalifa ME (2015) Feature analysis of coronary artery heart disease data sets. Proc Comput Sci 65:459–468
Neshat M, Sargolzaei M, Nadjaran Toosi A, Masoumi A (2012) Hepatitis disease diagnosis using hybrid case based reasoning and particle swarm optimization. Int Sch Res Not 2012
Akbar W, Wu W-p, Saleem S, Farhan M, Saleem MA, Javeed A, Ali L (2020) Development of hepatitis disease detection system by exploiting sparsity in linear support vector machine to improve strength of adaboost ensemble model. Mob Inf Syst 2020
Alshdaifat E, Alshdaifat D, Alsarhan A, Hussein F, El-Salhi SMFS et al (2021) The effect of preprocessing techniques, applied to numeric features, on classification algorithms’ performance. Data 6(2):11
Guney S, Kilinc I, Hameed AA, Jamil A (2022) Abalone age prediction using machine learning. In: Mediterranean conference on pattern recognition and artificial intelligence. Springer, pp 329–338
Sahin E, Saul CJ, Ozsarfati E, Yilmaz A (2018) Abalone life phase classification with deep learning. In: 2018 5th international conference on soft computing & machine intelligence (ISCMI). IEEE, pp 163–167
Dubey AK, Choudhary K, Sharma R (2021) Predicting heart disease based on influential features with machine learning. Intell Autom Soft Comput 30(3):929–943
Buscema M, Breda M, Lodwick W (2013) Training with input selection and testing (twist) algorithm: a significant advance in pattern recognition performance of machine learning
Nahar J, Imam T, Tickle KS, Chen Y-PP (2013) Computational intelligence for heart disease diagnosis: a medical knowledge driven approach. Expert Syst Appl 40(1):96–104
Shah SMS, Batool S, Khan I, Ashraf MU, Abbas SH, Hussain SA (2017) Feature extraction through parallel probabilistic principal component analysis for heart disease diagnosis. Phys A 482:796–807
Vijayashree J, Sultana HP (2018) A machine learning framework for feature selection in heart disease classification using improved particle swarm optimization with support vector machine classifier. Program Comput Softw 44(6):388–397
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Tripathi, K., Khan, F.A., Khanday, A.M.U.D. et al. The classification of medical and botanical data through majority voting using artificial neural network. Int. j. inf. tecnol. 15, 3271–3283 (2023). https://doi.org/10.1007/s41870-023-01361-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-023-01361-0