Abstract
The death rate in case of breast cancer existence can be reduced by identifying the tumour at an early stage. The survival rate can be increased if the tumour is identified initially and not spread to other organs. The mammography is able to recognize the various breast tissues with area size and criticality parameters. The machine learning algorithm can be applied on these breast tissue features to identify the chances of tumour recurrence. In this paper, a selective feature based improved decision tree algorithm is suggested to predict the chances of breast cancer occurence. Initially, each cancer descriptive symptom and features are processed under Chi square test to recognize the most contributing features. The ranked selected features are processed in the same order to generate the feature adaptive improved decision tree. For each tree node, the entropy and cost based rules are defined to predict the existence or non-existence of breast cancer. The proposed feature rank based improved decision tree is applied on two most popular breast cancer datasets taken from the UCI repository. The comparative results against the decision tree, naive bayes, random tree and random forest classifiers shows that the proposed model has predicted the breast cancer more accurately.
Similar content being viewed by others
References
Huangfu W, Wang F, Liu L, Long K, Lin X (2016) A breast cancer risk classification model based on the features selected by novel F-Score index for the imbalanced multi-feature dataset. In: International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), Chengdu, China, pp 198–203
Dutra I, Salvini R, Burnside E, Ferreira P (2016) Interpretable models to predict breast cancer. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Shenzhen, pp 1507–1511
Ngom A, Rueda L, Huy PQ (2016) A new feature selection approach for optimizing prediction models, applied to breast cancer subtype classification. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Shenzhen, pp 1535–1541
Cosma G, Brown D, Pockley AG, Alzubaidi A (2016) Breast cancer diagnosis using a hybrid genetic algorithm for feature selection based on mutual information. In: International Conference on Interactive Technologies and Games (ITAG), Notthingham, pp 70–76
Rajendran GS, Varsha J, Kavitha KR (2016) A correlation based SVM-recursive multiple feature elimination classifier for breast cancer disease using microarray. In: International Conference on Advances in Computing, Communications and Informatics (ICACCI), Jaipur, pp 2677–2683
Singh H, Sharma A, Ohri K (2016) Fuzzy expert system for diagnosis of Breast Cancer In: International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET), Chennai, pp 2487–2492
Sanwal K, Praveen S, Singh I (2016) Breast cancer detection using two-fold genetic evolution of neural network ensembles. In: International Conference on Data Science and Engineering (ICDSE), Cochin, pp 1–6
Bazazeh D, Shubair R (2016) Comparative study of machine learning algorithms for breast cancer detection and diagnosis. In: 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA), Ras Al Khaimah, pp 1–4
Munshi MAR, Sabab SA, Shihab S, Pritom AI (2016) Predicting breast cancer recurrence using effective classification and feature selection technique. In: 19th International Conference on Computer and Information Technology (ICCIT), Dhaka, Bangladesh, pp 310–314
Suryachandra P, Reddy PVS (2016) Comparison of machine learning algorithms for breast cancer. In: International Conference on Inventive Computation Technologies (ICICT), Coimbatore, pp 1–6
Jayaraj T, Sanjana VG, Sachin VPD (2016) A review on neural network and its implementation on breast cancer detection. In: International Conference on Communication and Signal Processing (ICCSP), Melmaruvathur, pp 1727–1730
Sundaram KS, Muthuselvan PS (2016) Prediction of breast cancer using classification rule mining techniques in blood test datasets. In: International Conference on Information Communication and Embedded Systems (ICICES), Chennai, pp 1–5
Lekha A, Bawane N, Rashmi GD (2015) Analysis of efficiency of classification and prediction algorithms (Naïve Bayes) for Breast Cancer dataset. In: International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT), Mandya, pp 108–113
Mohamed R, Sewissy AA, Soliman THA (2016) A hybrid analytical hierarchical process and deep neural networks approach for classifying breast cancer. In: 11th International Conference on Computer Engineering and Systems (ICCES), Cairo, pp 212–219
Seno S, Takenaka Y, Noguchi S, Matsuda H, Sota Y (2016) Comparative analysis of transformation methods for gene expression profiles in breast cancer datasets. In: IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE), Taichung, pp 328–333
Senapati MR, Mohanty AK, Dash S, Dash PK (2013) Local linear wavelet neural network for breast cancer recognition. Neural Comput and Appl 22(1):125–131
Saybani MR, Wah TY, Aghabozorgi SR, Shamshirband S, Kiah MLM, Balas VE (2016) Diagnosing breast cancer with an improved artificial immune recognition system. Soft Comput 20(10):4069–4084
El-Baz AH (2015) Hybrid intelligent system-based rough set and ensemble classifier for breast cancer diagnosis. Neural Comput Appl 26(2):437–446
Sangaiah I (2018) Vincent Antony Kumar A (2018) Improving medical diagnosis performance using hybrid feature selection via relieff and entropy based genetic search (RF-EGA) approach: application to breast cancer prediction. Cluster Comput. https://doi.org/10.1007/s10586-018-1702-5
Alickovic E, Subasi A (2017) Breast cancer diagnosis using GA feature selection and rotation forest. Neural Comput Appl 28(4):753–763
Ahmad F, Isa NA, Hussain Z, Osman MK, Sulaiman SN (2015) A GA-based feature selection and parameter optimization of an ANN in diagnosing breast cancer. Pattern Anal Appl 18(4):861–870
Agrawal Sanjay, Panda Rutuparna, Dora Ajith Abraham Lingraj (2017) Optimal breast cancer classification using Gauss-Newton representation based algorithm. Expert Syst Appl 85:134–145
Diosan L, Andreica A (2015) Multi-objective breast cancer classification by using multi-expression programming. Appl Intell 43(3):499–511
Breast-cancer. https://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer/. Retrieved 10 Mar 2017
https://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/. Retrieved 10 Mar 2017
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Juneja, K., Rana, C. An improved weighted decision tree approach for breast cancer prediction. Int. j. inf. tecnol. 12, 797–804 (2020). https://doi.org/10.1007/s41870-018-0184-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-018-0184-2