Supervised Machine Learning in a Nutshell

  • Chapter
  • First Online:
Data Science for Entrepreneurship

Part of the book series: Classroom Companion: Business ((CCB))

  • 1075 Accesses

Abstract

This chapter introduces the fundamental to supervised machine learning algorithms, namely the classification and regression problems. We explain each technique using an inspiring example and discuss how the corresponding algorithms work together with the data engineering pipelines. They provide some guidelines for implementing a classification or regression task for other problems and required materials to evaluate the supervised learning being used.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now
Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 67.40
Price includes VAT (Germany)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 64.19
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info
Hardcover Book
EUR 85.59
Price includes VAT (Germany)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    7 https://www.kaggle.com/mlg-ulb/creditcardfraud.

  2. 2.

    There is another type of regression, named ordinal regression, where the dependent variables are of ordinal type, each showing a rank assigned to a sample within the dataset.

  3. 3.

    In fact, it is the inverse of the covariance matrix if the data is normalized.

References

  • Beck, A., & Teboulle, M. (2009). A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM Journal on Imaging Sciences, 2(1), 183–202.

    Article  Google Scholar 

  • Bhattacharyya, S., Jha, S., Tharakunnel, K., & Westland, J. C. (2011). Data mining for credit card fraud: A comparative study. Decision Support Systems, 50(3), 602–613.

    Article  Google Scholar 

  • Cunningham, P., & Delany, S. J. (2020). k-nearest neighbour classifiers. ar**v preprint ar**v:2004.04523.

    Google Scholar 

  • Drucker, H., Burges, C. J. C., Kaufman, L., Smola, A. J., & Vapnik, V. (1997). Support vector regression machines. In Advances in neural information processing systems (pp. 155–161).

    Google Scholar 

  • Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of Machine Learning Research, 3, 1157–1182.

    Google Scholar 

  • Han, J., Pei, J., & Kamber, M. (2011). Data mining: Concepts and techniques. Elsevier.

    Google Scholar 

  • Hastie, T., Tibshirani, R., & Wainwright, M. (2015). Statistical learning with sparsity: The lasso and generalizations. CRC Press.

    Book  Google Scholar 

  • He, H., & Garcia, E. A. (2009). Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering, 21(9), 1263–1284.

    Article  Google Scholar 

  • Hoerl, A. E., Kannard, R. W., & Baldwin, K. F. (1975). Ridge regression: Some simulations. Communications in Statistics-Theory and Methods, 4(2), 105–123.

    Google Scholar 

  • Kim, S.-J., Koh, K., Lustig, M., Boyd, S., & Gorinevsky, D. (2007). An interior-point method for large-scale l1-regularized least squares. IEEE Journal of Selected Topics in Signal Processing, 1(4), 606–617.

    Article  Google Scholar 

  • Mitchell, T. M. (1999). Machine learning and data mining. Communications of the ACM, 42(11), 30–36.

    Article  Google Scholar 

  • Mohammadi, M. (2019). A projection neural network for the generalized lasso. IEEE Transactions on Neural Networks and Learning Systems, 31, 2217–2221.

    Article  Google Scholar 

  • Mohammadi, M., Mousavi, S. H., & Effati, S. (2019). Generalized variant support vector machine. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 51, 2798–2809.

    Article  Google Scholar 

  • Paasch, C. A. W. (2008). Credit card fraud detection using artificial neural networks tuned by genetic algorithms. Hong Kong University of Science and Technology (Hong Kong).

    Book  Google Scholar 

  • Rish, I., et al. (2001). An empirical study of the naive bayes classifier. IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, 3, 41–46.

    Google Scholar 

  • Samuel, A. L. (1959). Some studies in machine learning using the game of checkers. IBM Journal of Research and Development, 3(3), 210–229.

    Article  Google Scholar 

  • Shawe-Taylor, J., Cristianini, N., et al. (2004). Kernel methods for pattern analysis. Cambridge University Press.

    Book  Google Scholar 

  • Stone, M. (1974). Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society: Series B (Methodological), 36(2), 111–133.

    Google Scholar 

  • Suykens, J. A. K., & Vandewalle, J. (1999). Least squares support vector machine classifiers. Neural Processing Letters, 9(3), 293–300.

    Article  Google Scholar 

  • Whitrow, C., Hand, D. J., Juszczak, P., Weston, D., & Adams, N. M. (2009). Transaction aggregation as a strategy for credit card fraud detection. Data Mining and Knowledge Discovery, 18(1), 30–55.

    Article  Google Scholar 

  • Wright, R. E. (1995). Logistic regression. In L. G. Grimm & P. R. Yarnold (Eds.), Reading and understanding multivariate statistics (pp. 217–244). American Psychological Association.

    Google Scholar 

Further Reading

  • Aldridge, I. (2013). High-frequency trading: A practical guide to algorithmic strategies and trading systems (Vol. 604). John Wiley & Sons.

    Google Scholar 

  • Géron, A. (2019). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O’Reilly Media.

    Google Scholar 

  • Mitchell, T. M. (1997). Machine learning (Vol. 45(37), pp. 870–877). McGraw Hill.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Majid Mohammadi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mohammadi, M., Di Nucci, D. (2023). Supervised Machine Learning in a Nutshell. In: Liebregts, W., van den Heuvel, WJ., van den Born, A. (eds) Data Science for Entrepreneurship. Classroom Companion: Business. Springer, Cham. https://doi.org/10.1007/978-3-031-19554-9_6

Download citation

Publish with us

Policies and ethics

Navigation