Abstract
Malignant epithelial cell tumor also known as cancer is a deadly disease requiring a very costly and complex treatment. Early and accurate diagnosis of tumor plays an important role in reducing the mortality rate. With the rapid development of gene chip technology, gene expression data based tumor classification is helpful for accurate decision-making and has achieved great attention of researchers. Due to gene expression data having the properties of multi-class imbalance, high noise and high-dimensional small samples, in this paper, selective ensemble of doubly weighted fuzzy extreme learning machine (SEN-DWFELM) is presented for tumor classification. In view of good generalization performance of extreme learning machine (ELM), feature weighted fuzzy membership is embedded in ELM for eliminating classification error from noise samples. It considers the influence of feature importance on classification to acquire more accurate fuzzy membership. Simultaneously, by removing features with smaller weights it reduces the dimensionality of samples to improve training efficiency. Considering imbalanced learning, the weighted scheme is also introduced to enhance the effect of minority class samples on classification. Furthermore, doubly weighted fuzzy extreme learning machine (DWFELM) based selective ensemble algorithm is proposed to make classification performance more robust. Partial-based DWFELMs are selected using binary version of an improved whale optimization algorithm, and the selected base DWFELMs are integrated by majority voting. Finally, the proposed SEN-DWFELM is compared with conventional ensemble methods and variants of SEN-DWFELM on various gene expression data. Experimental results show that SEN-DWFELM remarkably outperforms other competitors in accordance with classification performance and can effectively deal with tumor diagnosis problems.
Similar content being viewed by others
References
Chen, W., Sun, K., Zeng, R., et al.: Cancer incidence and mortality in China, 2014. Chin. J. Cancer Res. 30(1), 1–12 (2018)
Kar, S., Sharma, K.D., Maitra, M.: Gene selection from microarray gene expression data for classification of cancer subgroups employing PSO and adaptive K-nearest neighborhood technique. Expert Syst. Appl. 42(1), 612–627 (2015)
Sun, L., Zhang, X.Y., Qian, Y.H., Xu, J.C., Zhang, S.G.: Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification. Inf. Sci. 502, 18–41 (2019)
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)
Wu, C., Li, Y.Q., Zhao, Z.B., Liu, B.: Extreme learning machine with autoencoding receptive fields for image classification. Neural Comput. Appl. 32, 8157–8173 (2020)
Wong, P.K., Huang, W., Vong, C.M., Yang, Z.X.: Adaptive neural tracking control for automotive engine idle speed regulation using extreme learning machine. Neural Comput. Appl. 32, 14399–14409 (2020)
Mohammed, A.A., Minhas, R., Wu, Q.M.J., Sid-Ahmed, M.A.: Human face recognition based on multidimensional PCA and extreme learning machine. Pattern Recognit. 44(10), 2588–2597 (2012)
Kaya, Y., Uyar, M.: A hybrid decision support system based on rough set and extreme learning machine for diagnosis of hepatitis disease. Appl. Soft Comput. 13(8), 3429–3438 (2013)
Lan, Y., Soh, Y.C., Huang, G.B.: Ensemble of online sequential extreme learning machine. Neurocomputing 72(13), 3391–3395 (2009)
Zhou, Z.H., Wu, J., Tang, W.: Ensembling neural networks: many could be better than all. Artif. Intell. 137(1–2), 239–263 (2002)
Shigei, N., Miyajima, H., Maeda, M., et al.: Bagging and AdaBoost algorithms for vector quantization. Neurocomputing 73(1), 106–114 (2009)
Li, K., Kong, X., Lu, Z., Liu, W., Yin, J.: Boosting weighted ELM for imbalanced learning. Neurocomputing 128, 15–21 (2014)
Cao, J.W., Lin, Z.P., Huang, G.B., Liu, N.: Voting based extreme learning machine. Inf. Sci. 185(1), 66–77 (2012)
Lu, H.J., An, C.L., Zheng, E.H., Lu, Y.: Dissimilarity based ensemble of extreme learning machine for gene expression data classification. Neurocomputing 128, 22–30 (2014)
Zhang, W.B., Ji, H.B.: Fuzzy extreme learning machine for classification. Electron. Lett. 49(7), 448–449 (2013)
He, H., Garcia, E.A.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284 (2009)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16(1), 321–357 (2002)
Liu, X.Y., Wu, J., Zhou, Z.H.: Exploratory undersampling for class-imbalance learning. IEEE Trans. Syst. Man Cybern. Part B 39(2), 539–550 (2009)
Gupta, U., Gupta, D.: Bipolar fuzzy based least squares twin bounded support vector machine. Fuzzy Set. Syst. 449, 120–161 (2022)
Hazarika, B.B., Gupta, D.: Density-weighted support vector machines for binary class imbalance learning. Neural Comput. Appl. 33(9), 4243–4261 (2021)
Gupta, D.: Training primal K-nearest neighbor based weighted twin support vector regression via unconstrained convex minimization. Appl. Intell. 47(3), 962–991 (2017)
Hazarika, B.B., Gupta, D.: Density weighted twin support vector machines for binary class imbalance learning. Neural Process. Lett. 54(2), 1091–1130 (2022)
Hancer, E., Xue, B., Zhang, M.J.: Differential evolution for filter feature selection based on information theory and feature ranking. Knowl.-based Syst. 140, 103–119 (2018)
Mirjalili, S., Lewis, A.: The whale optimization algorithm. Adv. Eng. Softw. 95, 51–67 (2016)
Yan, Z.P., Zhang, J.Z., Zeng, J., Tang, J.L.: Nature-inspired approach: an enhanced whale optimization algorithm for global optimization. Math. Comput. Simul. 185, 17–46 (2021)
Sun, Y.J., Wang, X.L., Chen, Y.H., Liu, Z.J.: A modified whale optimization algorithm for large-scale global optimization problems. Expert Syst. Appl. 114, 563–577 (2018)
Fan, Q., Chen, Z.J., Li, Z., **a, Z.H., Yu, J.Y., Wang, D.Z.: A new improved whale optimization algorithm with joint search mechanisms for high-dimensional global optimization problems. Eng. Comput. 37(3), 1851–1878 (2021)
Wang, J.Z., Du, P., Niu, T., Yang, W.D.: A novel hybrid system based on a new proposed algorithm-multi-objective whale optimization algorithm for wind speed forecasting. Appl. Energy 208, 344–360 (2017)
Aziz, M.A.E., Ewees, A.A., Hassanien, A.E.: Whale optimization algorithm and moth-flame optimization for multilevel thresholding image segmentation. Expert Syst. Appl. 83, 242–256 (2017)
Mafarja, M.M., Mirjalili, S.: Hybrid whale optimization algorithm with simulated annealing for feature selection. Neurocomputing 260, 302–312 (2017)
Gao, L.Y., Ye, M.Q., Lu, X.J., Huang, D.B.: Hybrid method based on information gain and support vector machine for gene selection in cancer classification. Genom. Proteom. Bioinf. 15(6), 389–395 (2017)
Rani, M.J., Devaraj, D.: Two-stage hybrid gene selection using mutual information and genetic algorithm for cancer data classification. J. Med. Syst. 43(8), 235 (2019)
Tavasoli, N., Rezaee, K., Momenzadeh, M., Sehhati, M.: An ensemble soft weighted gene selection-based approach and cancer classification using modified metaheuristic learning. J. Comput. Des. Eng. 8(4), 1172–1189 (2021)
Lu, H.J., Chen, J.Y., Yan, K., **, Q., Xue, Y., Gao, Z.G.: A hybrid feature selection algorithm for gene expression data classification. Neurocomputing 256, 56–62 (2017)
Mondal, M., Semwal, R., Raj, U., Aier, I., Varadwaj, P.K.: An entropy-based classification of breast cancerous genes using microarray data. Neural Comput. Appl. 32(7), 2397–2404 (2020)
Shukla, A.K., Singh, P., Vardhan, M.: Gene selection for cancer types classification using novel hybrid metaheuristics approach. Swarm Evol. Comput. 54, 100661 (2020)
Dabba, A., Tari, A., Meftali, S., Mokhtari, R.: Gene selection and classification of microarray data method based on mutual information and moth flame algorithm. Expert Syst. Appl. 166, 114012 (2021)
Wang, Y., Wang, A.N., Ai, Q., Sun, H.J.: Enhanced kernel-based multilayer fuzzy weighted extreme learning machines. IEEE Access 8, 166246–166260 (2020)
Wang, Y., Wang, A.N., Ai, Q., Sun, H.J.: An adaptive kernel-based weighted extreme learning machine approach for effective detection of Parkinson’s disease. Biomed. Signal Process. Control 38, 400–410 (2017)
Bartlett, P.L.: The sample complexity of pattern classification with neural networks: the size of the weights is more important than the size of the network. IEEE Trans. Inf. Theory 44(2), 525–536 (1998)
Zong, W.W., Huang, G.B., Chen, Y.Q.: Weighted extreme learning machine for imbalance learning. Neurocomputing 101(3), 229–242 (2013)
Palma-Mendoza, R.J., Rodriguez, D., De-Marcos, L.: Distributed ReliefF-based feature selection in spark. Knowl. Inf. Syst. 57(1), 1–20 (2018)
Alotaibi, A.S.: Hybrid model based on ReliefF algorithm and k-nearest neighbor for erythemato-squamous diseases forecasting. Arab. J. Sci. Eng. 47(2), 1299–1307 (2022)
Tizhoosh, R.H.: Opposition-based learning: A new scheme for machine intelligence. In: International Conference on Computational Intelligence for Modelling, Control and Automation, pp. 695–701 (2005)
Rahnamayan, S., Tizhoosh, H.R., Salama, M.M.A.: Quasi-oppositional differential evolution. In: 2007 IEEE Congress on Evolutionary Computation, pp. 2229-2236 (2007)
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of IEEE International Conference on Neural Network, pp. 1942–1948 (1995)
Kennedy, J., Eberhart, R.C.: A discrete binary version of the particle swarm algorithm. In: Proceedings of IEEE International Conference on Systems, Man and Cybernetics, pp. 4104–4108 (1997)
Wang, Y., Wang, A.N., Ai, Q., Sun, H.J.: Ensemble based fuzzy weighted extreme learning machine for gene expression classification. Appl. Intell. 49, 1161–1171 (2019)
Acknowledgements
This work was supported by talent scientific research fund of LIAONING PETROCHEMICAL UNIVERSITY (No.2023XJJL-006).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, Y. Selective ensemble of doubly weighted fuzzy extreme learning machine for tumor classification. Prog Artif Intell (2024). https://doi.org/10.1007/s13748-024-00319-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13748-024-00319-y