Customer Churn Prediction in Telecommunications Using Gradient Boosted Trees

  • Conference paper
  • First Online:
International Conference on Innovative Computing and Communications

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1059))

Abstract

Customer churn is a critical problem faced by many industries these days. It is 5–10 times more valuable to keep a long-term customer than acquiring a new one. This paper addresses the problem of customer churn with respect to telecommunication industry as churn rate is quite high in this industry (ranging from 10 to 60%) in comparison to others. Predicting customer churn in advance can help these companies in retaining their customers. The paper proposes XGBoost algorithm as a model with the best performance among other state-of-the-art algorithms. The previously used models focus more on the accurate prediction of churners as compared to non-churners, whereas the proposed model classifies churners among the total churners correctly and is able to achieve the highest True positive rate of 81% and AUC score of 0.85. Also, concepts of data transformation, feature selection, and data balancing using oversampling are applied for the same.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (France)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 117.69
Price includes VAT (France)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 158.24
Price includes VAT (France)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Hand D (2007) Principles of data mining. Drug Saf 30:621–622

    Article  Google Scholar 

  2. Osisanwo FY, Akinsola JE, Awodele O et al (2017) Supervised machine learning algorithms: classification and comparison. Inter J Comput Trends Technol 48:128–138

    Article  Google Scholar 

  3. Dalvi PK, Khandge SK, Deomore A et al (2016) Analysis of customer churn prediction in telecom industry using decision trees and logistic regression. In: Symposium on colossal data analysis and networking (CDAN)

    Google Scholar 

  4. Kisioglu P, Topcu YI (2011) Applying bayesian belief network approach to customer churn analysis: a case study on the telecom industry of Turkey. Expert Syst Appl 38:7151–7157

    Article  Google Scholar 

  5. Brandusoiu I, Toderean G (2013) Churn prediction in the telecommunications sector using support vector machines. Margin 1:x1

    Google Scholar 

  6. Lu N, Lin H, Lu J, Zhang G (2014) A customer churn prediction model in telecom industry using boosting. IEEE Trans Ind Inform 10:1659–1665

    Article  Google Scholar 

  7. Adwan O, Faris H, Jaradat K et al (2014) Predicting customer churn in telecom industry using multilayer preceptron neural networks: modeling and analysis. Life Sci J 11(2):75–81

    Google Scholar 

  8. Huang B, Kechadi MT, Buckley B (2012) Customer churn prediction in telecommunications. Expert Syst Appl 39:1414–1425

    Article  Google Scholar 

  9. **a G-E, ** W-D (2008) Model of customer churn prediction on support vector machine. Syst Eng Theor Pract 28:71–77

    Article  Google Scholar 

  10. Amin A, Al-Obeidat F, Shah B et al (2019) Customer churn prediction in telecommunication industry using data certainty. J Bus Res 94:290–301

    Article  Google Scholar 

  11. Azeem M, Usman M, Fong ACM (2017) A churn prediction model for prepaid customers in telecom using fuzzy classifiers. Telecommun Syst 66:603–614

    Article  Google Scholar 

  12. Chen T, Guestrin C (2016) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 785–794

    Google Scholar 

  13. Maclin R, Opitz D (1997) An empirical evaluation of bagging and boosting. AAAI/IAAI, pp 546–551

    Google Scholar 

  14. Ying C, Qi-Guang M, Jia-Chen L, Lin G (2014) Advance and prospects of adaboost algorithm. Acta Automatica Sinica 39:745–758

    Article  Google Scholar 

  15. Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 1:189–232

    MathSciNet  MATH  Google Scholar 

  16. Pohjalainen V (2017) Predicting service contract churn with decision tree models

    Google Scholar 

  17. Ajit P (2016) Prediction of employee turnover in organizations using machine learning algorithms. Algorithms 4(5):C5

    Google Scholar 

  18. Zhao J, Wang W, Sheng C (2018) Data preprocessing techniques. In: Data driven prediction for industrial process and their applications. Springer, Berlin

    Google Scholar 

  19. Churn in telecom’s dataset. https://www.kaggle.com/becksddf/churn-in-telecoms-dataset. Accessed 20 Sep 2018

  20. Dash M, Liu H (1997) Feature selection for classification. Intell Data Anal 1(3):131–156

    Article  Google Scholar 

  21. Chawla NV, Bowyer KW, Hall LO et al (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357

    Article  Google Scholar 

  22. Jiménez-Valverde A (2012) Insights into the area under the receiver operating characteristic curve (AUC) as a discrimination measure in species distribution modeling. Glob Ecol Biogeogr 2(4):498–507

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tanu Sharma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sharma, T., Gupta, P., Nigam, V., Goel, M. (2020). Customer Churn Prediction in Telecommunications Using Gradient Boosted Trees. In: Khanna, A., Gupta, D., Bhattacharyya, S., Snasel, V., Platos, J., Hassanien, A. (eds) International Conference on Innovative Computing and Communications. Advances in Intelligent Systems and Computing, vol 1059. Springer, Singapore. https://doi.org/10.1007/978-981-15-0324-5_20

Download citation

Publish with us

Policies and ethics

Navigation