Abstract
Ensemble classifiers provide an efficient method to deal with diverse set of applications in various domains. The proposed research signifies the effectiveness of ensemble classifier for computer-aided breast cancer diagnosis. A novel combination of five heterogeneous classifiers namely Naïve Bayes, Decision tree using Gini index, Decision tree using information gain, Support vector machine and Memory based learner are used to make an ensemble framework. Weighted voting technique is used to determine the final prediction where weights are assigned on the basis of classification accuracy. Four different breast cancer datasets are used from online data repositories. Feature selection and various preprocessing techniques are applied on the datasets to enhance the classification accuracy. The analyses of experimental results show that the proposed ensemble technique provided a significant improvement as compared to other classifiers. The best accuracy achieved by proposed ensemble is 97.42 % whereas the best precision and recall is 100 and 98.60 % respectively.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11135-014-0090-z/MediaObjects/11135_2014_90_Fig1_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11135-014-0090-z/MediaObjects/11135_2014_90_Fig2_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11135-014-0090-z/MediaObjects/11135_2014_90_Fig3_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11135-014-0090-z/MediaObjects/11135_2014_90_Fig4_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11135-014-0090-z/MediaObjects/11135_2014_90_Fig5_HTML.gif)
Similar content being viewed by others
References
Abonyi, J., Szeifert, F.: Supervised fuzzy clustering for the identification of fuzzy classifiers. Pattern Recognit. Lett. 14(24), 2195–2207 (2003)
American Cancer Society Homepage www.cancer.org (2008) [last accessed: 31 March 2014]
Angeline Christobel, Y., Dr. Sivaprakasam: An Empirical Comparison of Data Mining Classification Methods. Int. J. Comput. Inf. Syst., 3(2), (2011)
Aruna, S., Rajagopalan, S.P., Nandakishore, L.V.: Knowledge based analysis of various statistical tools in detecting breast cancer. CCSEA, CS IT 02, 37–45 (2011)
Ashfaq, A.K., Aljahdali, S., Hussain, S.N.: Comparative prediction performance with support vector machine and random forest classification techniques. Int. J. Comput. Appl. 69(11), 0975–8887 (2013)
Bishop, C.: Neural Networks for Pattern Recognition. Oxford University Press, New York (1995)
Chen, H., Yang, B., Liu, J., Liu, D.: A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst. Appl. 38, 9014–9022 (2011)
Chen, H.L., Yang, B., et al.: A three-stage expert system based on support vector machines for thyroid disease diagnosis. J. Med. Syst. 36(3), 1953–1963 (2011)
Christobel, A., Sivaprakasam, Y.: An empirical comparison of data mining classification methods. Int. J. Comput. Inf. Syst. 3, 24–28 (2011)
Chunekar, V.N., Ambulgekar, H.P.: Approach of neural network to diagnose breast cancer on three different data set. In: international conference on advances in recent technologies in communication and computing. ARTCom’09 International Conference on IEEE pp 893–895 (2009)
Fan, C., Chang, P., Lin, J., Hsieh, J.C.: A hybrid model combining case-based reasoning and fuzzy decision tree for medical data classification. Appl. Soft Comput. 11, 632–644 (2011)
Goodman, D.E., Boggess, L., Watkins, A.: Artificial immune system classification of multiple-class problems, In: Proceedings of the Artificial Neural Networks in Engineering ANNIE, pp. 179–183 (2002)
Han, J., Kamber, M.: Data mining: concepts and techniques, 3rd edition (2006)
Hastie & Tibshirani: Cross validation and bootstrap, February 25, (2009)
Joachims, T.: Transductive inference for text classification using support vector machines. In: Proceedings of International Conference Machine Learning. Slovenia. pp. 200–209 (1999)
Keles, A., Keles, A., Yavuz, U.: Expert system based on neuro-fuzzy rules for diagnosis breast cancer. Experts syst. Appl. 38, 5719–5726 (2011)
Kovalerchuck, B., Triantaphyllou, E., Ruiz, J.F., Clayton, J.: Fuzzy logic in computer-aided breast-cancer diagnosis: analysis of lobulation. Artif. Intell. Med. 11, 75–85 (1997)
Lavanya, D., Rani, K.U.: Analysis of feature selection with classification: breast cancer datasets. Indian J. Comput. Sci. Eng. (IJCSE), 2(5) (2011)
Lavanya, D., Rani, K.U.: Ensemble decision tree classifier for breast cancer data. In: International Journal of Information Technology Convergence and Services (IJITCS) 2(1) (2012)
Luo, S.T., Cheng, B.W.: Diagnosing breast masses in digital mammography using feature selection and ensemble methods. J. Med. Syst. 36(2), 569–577 (2010)
Maglogiannis, I., Zafiropoulos, E., et al.: An intelligent system for automated breast cancer diagnosis and prognosis using SVM based classifiers. Appl. Intel. 30, 24–36 (2009)
Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)
Opitz, D., Shavlik, J.: Generating accurate and diverse members of a neuralá1network ensemble. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (eds.) Advances in Neural Information Processing Systems, Vol 8, pp 535–541 (1996)
E.Osuna, E., Freund, R., Girosi, F.: Training support vector machines: Application to face detection. Proceedings of Computer Vision and Pattern Recognition, Puerto Rico pp. 130–136 (1997)
Pendharkar, P.C., Rodger, J.A., Yaverbaum, G.J., Herman, N., Benner, M.: Associations statistical, mathematical and neural approaches for mining breast cancer patterns. Expert Syst. Appl. 17, 223–232 (1999)
Polat, K., Günes, S.: Breast cancer diagnosis using least square support vector machine. Digit. Signal Process. 17(4), 694–701 (2007)
Polat, K., Gunes, S.: An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease. Digit. Signal Process. 17, 702–710 (2007)
Quinlan, J.R.: Improved use of continuous attributes in C4.5. J. Artif. Intell. Res. 4, 77–90 (1996)
Rokach, L.: Ensemble-based classifiers. Artif. Intell. Rev. 33, 1–39 (2010)
Roses, D.F.: Clinical assessment of breast cancer and benign breast disease, Ch. 14. In: Harris, M.N. (ed.) Breast Cancer, pp. 15–26. Churchill Livingstone, Philadelphia (2005)
Salama, G.I., Abdelhalim, M.B., Zeid, M.A.: Breast cancer diagnosis on three different datasets using multi-classifiers. Int. J. Comput. Inf. Technol. 01(01), 764–2277 (2012)
Ster, B., Dobnikar, A.: Neural networks in medical diagnosis: Comparison with other methods. In: Proceedings of the International Conference on Engineering Applications of Neural Networks pp. 427–430 (1996)
Subashini, T.S., Ramalingam, V., Palanivel, S.: Breast mass classification based on cytological patterns using RBFNN and SVM. Expert Syst. Appl. 36, 5284–5290 (2009)
Tsoumakas, G., Angelis, L., Vlahavas, I.: Selective fusion of heterogeneous classifiers. Intell. Data Analysis. 9, 511–525 (2005)
Tsoumakas, G., Angelis, L., Vlahavas, I.: Selective fusion of heterogeneous classifiers. Intell. Data Anal. 9, 511–525 (2005)
Tu, M.C., Shin, D., Shin, D.: Effective diagnosis of heart disease through bagging approach. 2nd international conference on biomedical engineering and informatics. pp 1–4 (2009)
Übeyli, E.D.: Implementing automated diagnostic systems for breast cancer detection. Expert Syst. Appl. 33(4), 1054–1062 (2007)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bashir, S., Qamar, U. & Khan, F.H. Heterogeneous classifiers fusion for dynamic breast cancer diagnosis using weighted vote based ensemble. Qual Quant 49, 2061–2076 (2015). https://doi.org/10.1007/s11135-014-0090-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11135-014-0090-z