Log in

An improved weighted decision tree approach for breast cancer prediction

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

The death rate in case of breast cancer existence can be reduced by identifying the tumour at an early stage. The survival rate can be increased if the tumour is identified initially and not spread to other organs. The mammography is able to recognize the various breast tissues with area size and criticality parameters. The machine learning algorithm can be applied on these breast tissue features to identify the chances of tumour recurrence. In this paper, a selective feature based improved decision tree algorithm is suggested to predict the chances of breast cancer occurence. Initially, each cancer descriptive symptom and features are processed under Chi square test to recognize the most contributing features. The ranked selected features are processed in the same order to generate the feature adaptive improved decision tree. For each tree node, the entropy and cost based rules are defined to predict the existence or non-existence of breast cancer. The proposed feature rank based improved decision tree is applied on two most popular breast cancer datasets taken from the UCI repository. The comparative results against the decision tree, naive bayes, random tree and random forest classifiers shows that the proposed model has predicted the breast cancer more accurately.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Huangfu W, Wang F, Liu L, Long K, Lin X (2016) A breast cancer risk classification model based on the features selected by novel F-Score index for the imbalanced multi-feature dataset. In: International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), Chengdu, China, pp 198–203

  2. Dutra I, Salvini R, Burnside E, Ferreira P (2016) Interpretable models to predict breast cancer. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Shenzhen, pp 1507–1511

  3. Ngom A, Rueda L, Huy PQ (2016) A new feature selection approach for optimizing prediction models, applied to breast cancer subtype classification. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Shenzhen, pp 1535–1541

  4. Cosma G, Brown D, Pockley AG, Alzubaidi A (2016) Breast cancer diagnosis using a hybrid genetic algorithm for feature selection based on mutual information. In: International Conference on Interactive Technologies and Games (ITAG), Notthingham, pp 70–76

  5. Rajendran GS, Varsha J, Kavitha KR (2016) A correlation based SVM-recursive multiple feature elimination classifier for breast cancer disease using microarray. In: International Conference on Advances in Computing, Communications and Informatics (ICACCI), Jaipur, pp 2677–2683

  6. Singh H, Sharma A, Ohri K (2016) Fuzzy expert system for diagnosis of Breast Cancer In: International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET), Chennai, pp 2487–2492

  7. Sanwal K, Praveen S, Singh I (2016) Breast cancer detection using two-fold genetic evolution of neural network ensembles. In: International Conference on Data Science and Engineering (ICDSE), Cochin, pp 1–6

  8. Bazazeh D, Shubair R (2016) Comparative study of machine learning algorithms for breast cancer detection and diagnosis. In: 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA), Ras Al Khaimah, pp 1–4

  9. Munshi MAR, Sabab SA, Shihab S, Pritom AI (2016) Predicting breast cancer recurrence using effective classification and feature selection technique. In: 19th International Conference on Computer and Information Technology (ICCIT), Dhaka, Bangladesh, pp 310–314

  10. Suryachandra P, Reddy PVS (2016) Comparison of machine learning algorithms for breast cancer. In: International Conference on Inventive Computation Technologies (ICICT), Coimbatore, pp 1–6

  11. Jayaraj T, Sanjana VG, Sachin VPD (2016) A review on neural network and its implementation on breast cancer detection. In: International Conference on Communication and Signal Processing (ICCSP), Melmaruvathur, pp 1727–1730

  12. Sundaram KS, Muthuselvan PS (2016) Prediction of breast cancer using classification rule mining techniques in blood test datasets. In: International Conference on Information Communication and Embedded Systems (ICICES), Chennai, pp 1–5

  13. Lekha A, Bawane N, Rashmi GD (2015) Analysis of efficiency of classification and prediction algorithms (Naïve Bayes) for Breast Cancer dataset. In: International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT), Mandya, pp 108–113

  14. Mohamed R, Sewissy AA, Soliman THA (2016) A hybrid analytical hierarchical process and deep neural networks approach for classifying breast cancer. In: 11th International Conference on Computer Engineering and Systems (ICCES), Cairo, pp 212–219

  15. Seno S, Takenaka Y, Noguchi S, Matsuda H, Sota Y (2016) Comparative analysis of transformation methods for gene expression profiles in breast cancer datasets. In: IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE), Taichung, pp 328–333

  16. Senapati MR, Mohanty AK, Dash S, Dash PK (2013) Local linear wavelet neural network for breast cancer recognition. Neural Comput and Appl 22(1):125–131

    Article  Google Scholar 

  17. Saybani MR, Wah TY, Aghabozorgi SR, Shamshirband S, Kiah MLM, Balas VE (2016) Diagnosing breast cancer with an improved artificial immune recognition system. Soft Comput 20(10):4069–4084

    Article  Google Scholar 

  18. El-Baz AH (2015) Hybrid intelligent system-based rough set and ensemble classifier for breast cancer diagnosis. Neural Comput Appl 26(2):437–446

    Article  Google Scholar 

  19. Sangaiah I (2018) Vincent Antony Kumar A (2018) Improving medical diagnosis performance using hybrid feature selection via relieff and entropy based genetic search (RF-EGA) approach: application to breast cancer prediction. Cluster Comput. https://doi.org/10.1007/s10586-018-1702-5

    Article  Google Scholar 

  20. Alickovic E, Subasi A (2017) Breast cancer diagnosis using GA feature selection and rotation forest. Neural Comput Appl 28(4):753–763

    Article  Google Scholar 

  21. Ahmad F, Isa NA, Hussain Z, Osman MK, Sulaiman SN (2015) A GA-based feature selection and parameter optimization of an ANN in diagnosing breast cancer. Pattern Anal Appl 18(4):861–870

    Article  MathSciNet  Google Scholar 

  22. Agrawal Sanjay, Panda Rutuparna, Dora Ajith Abraham Lingraj (2017) Optimal breast cancer classification using Gauss-Newton representation based algorithm. Expert Syst Appl 85:134–145

    Article  Google Scholar 

  23. Diosan L, Andreica A (2015) Multi-objective breast cancer classification by using multi-expression programming. Appl Intell 43(3):499–511

    Article  Google Scholar 

  24. Breast-cancer. https://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer/. Retrieved 10 Mar 2017

  25. https://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/. Retrieved 10 Mar 2017

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kapil Juneja.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Juneja, K., Rana, C. An improved weighted decision tree approach for breast cancer prediction. Int. j. inf. tecnol. 12, 797–804 (2020). https://doi.org/10.1007/s41870-018-0184-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-018-0184-2

Keywords

Navigation