Predictive Analytics for Targeting Decisions

  • Chapter
  • First Online:
Machine Learning for Data Science Handbook

Abstract

Predictive Analytics (PA) models are assuming increasing role in the world of big data for making decisions in many industries – marketing, banking, insurance, telecommunication, healthcare, cyber, and more. Even in the era of data mining and machine learning, the leading predictive models still belong to the realm of regression. While regression models were originally developed to explain phenomena, find relationships between variables, and draw conclusions, in prediction models the main objective is to build models which are general enough to apply for predicting unseen data, even at the expense of giving up some model accuracy. Therefore, models with good explanation power are not necessarily models with good prediction power, and vice versa. Focusing on regression models, we discuss in this article the differences between explanation and prediction models, propose several principles for building good predictive models, present several performance measures for assessing the quality of the prediction results in classification problems based on logistic regression, and conclude by discussing the deployment process of the model results for decision-making. We end up this article by briefly reviewing the non-parametric decision tree approach for building the PA model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now
Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 213.99
Price includes VAT (Germany)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
EUR 267.49
Price includes VAT (Germany)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    In the following we use the terms predictors, explanatory variables, variables, independent variables, attributes and features interchangeably

References

  • Akaike, H. (1974). A New Look at the Statistical Identification Model, IEEE Trans. Auto. Control, 19, pp. 716-723.

    Article  MATH  Google Scholar 

  • Ben-Akiva, M., and Lerman, S.R. (1987). Discrete Choice Analysis, the MIT Press, Cambridge, MA.

    Google Scholar 

  • Benjamini, Y., and Hochberg, Y. (1995). Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing, J. R. Statistical Society, 57, pp. 289-300.

    MathSciNet  MATH  Google Scholar 

  • Bifet, A., Gavaldà, R., Holmes, G., and Pfahringer, B. (2018). Machine Learning for Data Streams with Practical Examples in Massive Online Analysis, Adaptive Computation and Machine Learning, MIT Press.

    Google Scholar 

  • Breiman, L., Friedman, J., Olshen, R., and Stone, C. (1984), Classification and Regression Trees, Belmont, CA., Wadsworth.

    MATH  Google Scholar 

  • DeGroot, M. H. (1993). Probability and Statistics 3rd edition, Addison-Wesley.

    MATH  Google Scholar 

  • Diaconis, P. (1985). Theories of Data Analysis from Magical Thinking through Classical Statistics. In Exploring data tables, trends and shapes , Hoaglin, D. C., Mosteller, F. and Tukey, J. W. (edts), Wiley, NY, pp. 1–36.

    Google Scholar 

  • Elkan, C. (1997). Boosting and Naïve Bayesian Learning, Technical Report CS97-557, University of San Diego, California.

    Google Scholar 

  • Geman, S., Bienenstock, E., and Doursat, R. (1992). Neural Networks and Bias/Variance Dilema, Nueral Computation, 4, pp. 1–58.

    Article  Google Scholar 

  • Efoymson, M.A. (1960). Multiple Regression Analysis in Mathematical Method for Digital computers, Wiley, NY, pp. 191–203.

    Google Scholar 

  • Goldberg, D. E. (1989). Genetic Algorithms in Search Optimization and Machine Learning. Addison-Wesley Publishing Company Inc.

    MATH  Google Scholar 

  • Hastie, T., Tibshirani, R., and Friedman, J. (2009). The elements of Statistical Learning, Springer, NY.

    Book  MATH  Google Scholar 

  • Hochberg, Y., and Tamhane, A. C. (1987). Multiple Comparison Proceedings, John Wiley & Sons.

    Book  MATH  Google Scholar 

  • Kelleher, J. D., MacNamee, B., and D’Arcy, A. (2015). Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies, MIT Press.

    MATH  Google Scholar 

  • Lambert, P. J. (1993). The Distribution and Redistribution of Income, Manchester University Press.

    Google Scholar 

  • Levin, N., and Zahavi, J. (2005). Data Mining for Target Marketing, The Data Mining and Knowledge Discovery Handbook, Maimon O., and Rokach, L., (eds), Springer, NY, pp. 1261–1301.

    Chapter  Google Scholar 

  • Miller, A. J. (2002). Subset Selection in Regression, Chapman and Hall.

    Book  MATH  Google Scholar 

  • Murthy, K.S. (1998). Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey, Data Mining and Knowledge Discovery, 2, pp. 45–389.

    Article  Google Scholar 

  • Quinlan, J.R. (1986). Induction of Decision Trees, Machine Learning, 1, pp. 81–106.

    Article  Google Scholar 

  • Quinlan, J.R. (1993). C4.5: Program for Machine Learning, Morgan Kaufman Publishing, CA.

    Google Scholar 

  • Rumelhart, D.E., McClelland, J.L., and Williams, R.J. (1986). Learning Internal Representation by Error Propagation, in Parallel Distributed Processing: Exploring the Microstructure of Cognition, Rumelhart, D.E., McClelland, J.L. and the PDP Researc Group, (eds.), MIT Press, Cambridge, MA.

    Google Scholar 

  • Schwartz, G. (1978). Estimating the Dimension of a Model, Annals of Statistics, 6, pp. 486–494.

    MathSciNet  Google Scholar 

  • Shannon, C.E. (1948). A Mathematical Theory of Communications, Bell System Technical Journal, 27, pp. 379–423 & 623–656.

    Google Scholar 

  • Shmueli, G. (2010). To Explain or to Predict? Statistical Science, 25, pp. 289–310.

    Article  MathSciNet  MATH  Google Scholar 

  • Shmueli, G., and Koppius, O. (2011). Predictive Analytics in Information Systems Research, MIS Quarterly, 34, pp. 553–572.

    Article  Google Scholar 

  • Tukey, J.W. (1977). Some Thoughts on Clinical Trials, Especially Problems of Multiplicity, Science, 198, pp.679–684.

    Google Scholar 

  • Vapnik, V.N. (1995). Support Vector Machines, Machine Learning, 20, pp. 273–297.

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jacob Zahavi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Zahavi, J. (2023). Predictive Analytics for Targeting Decisions. In: Rokach, L., Maimon, O., Shmueli, E. (eds) Machine Learning for Data Science Handbook. Springer, Cham. https://doi.org/10.1007/978-3-031-24628-9_33

Download citation

Publish with us

Policies and ethics

Navigation