Abstract
This chapter introduces the fundamental to supervised machine learning algorithms, namely the classification and regression problems. We explain each technique using an inspiring example and discuss how the corresponding algorithms work together with the data engineering pipelines. They provide some guidelines for implementing a classification or regression task for other problems and required materials to evaluate the supervised learning being used.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
There is another type of regression, named ordinal regression, where the dependent variables are of ordinal type, each showing a rank assigned to a sample within the dataset.
- 3.
In fact, it is the inverse of the covariance matrix if the data is normalized.
References
Beck, A., & Teboulle, M. (2009). A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM Journal on Imaging Sciences, 2(1), 183–202.
Bhattacharyya, S., Jha, S., Tharakunnel, K., & Westland, J. C. (2011). Data mining for credit card fraud: A comparative study. Decision Support Systems, 50(3), 602–613.
Cunningham, P., & Delany, S. J. (2020). k-nearest neighbour classifiers. ar**v preprint ar**v:2004.04523.
Drucker, H., Burges, C. J. C., Kaufman, L., Smola, A. J., & Vapnik, V. (1997). Support vector regression machines. In Advances in neural information processing systems (pp. 155–161).
Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of Machine Learning Research, 3, 1157–1182.
Han, J., Pei, J., & Kamber, M. (2011). Data mining: Concepts and techniques. Elsevier.
Hastie, T., Tibshirani, R., & Wainwright, M. (2015). Statistical learning with sparsity: The lasso and generalizations. CRC Press.
He, H., & Garcia, E. A. (2009). Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering, 21(9), 1263–1284.
Hoerl, A. E., Kannard, R. W., & Baldwin, K. F. (1975). Ridge regression: Some simulations. Communications in Statistics-Theory and Methods, 4(2), 105–123.
Kim, S.-J., Koh, K., Lustig, M., Boyd, S., & Gorinevsky, D. (2007). An interior-point method for large-scale l1-regularized least squares. IEEE Journal of Selected Topics in Signal Processing, 1(4), 606–617.
Mitchell, T. M. (1999). Machine learning and data mining. Communications of the ACM, 42(11), 30–36.
Mohammadi, M. (2019). A projection neural network for the generalized lasso. IEEE Transactions on Neural Networks and Learning Systems, 31, 2217–2221.
Mohammadi, M., Mousavi, S. H., & Effati, S. (2019). Generalized variant support vector machine. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 51, 2798–2809.
Paasch, C. A. W. (2008). Credit card fraud detection using artificial neural networks tuned by genetic algorithms. Hong Kong University of Science and Technology (Hong Kong).
Rish, I., et al. (2001). An empirical study of the naive bayes classifier. IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, 3, 41–46.
Samuel, A. L. (1959). Some studies in machine learning using the game of checkers. IBM Journal of Research and Development, 3(3), 210–229.
Shawe-Taylor, J., Cristianini, N., et al. (2004). Kernel methods for pattern analysis. Cambridge University Press.
Stone, M. (1974). Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society: Series B (Methodological), 36(2), 111–133.
Suykens, J. A. K., & Vandewalle, J. (1999). Least squares support vector machine classifiers. Neural Processing Letters, 9(3), 293–300.
Whitrow, C., Hand, D. J., Juszczak, P., Weston, D., & Adams, N. M. (2009). Transaction aggregation as a strategy for credit card fraud detection. Data Mining and Knowledge Discovery, 18(1), 30–55.
Wright, R. E. (1995). Logistic regression. In L. G. Grimm & P. R. Yarnold (Eds.), Reading and understanding multivariate statistics (pp. 217–244). American Psychological Association.
Further Reading
Aldridge, I. (2013). High-frequency trading: A practical guide to algorithmic strategies and trading systems (Vol. 604). John Wiley & Sons.
Géron, A. (2019). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O’Reilly Media.
Mitchell, T. M. (1997). Machine learning (Vol. 45(37), pp. 870–877). McGraw Hill.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Mohammadi, M., Di Nucci, D. (2023). Supervised Machine Learning in a Nutshell. In: Liebregts, W., van den Heuvel, WJ., van den Born, A. (eds) Data Science for Entrepreneurship. Classroom Companion: Business. Springer, Cham. https://doi.org/10.1007/978-3-031-19554-9_6
Download citation
DOI: https://doi.org/10.1007/978-3-031-19554-9_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-19553-2
Online ISBN: 978-3-031-19554-9
eBook Packages: Business and ManagementBusiness and Management (R0)